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A  large  collection  of  Gram  positive,  spore-forming, 
thermophilic,  cellulolytic  bacilli  were  isolated  from  diverse 
natural  sources.  These  organisms  were  characterized  and  fell 
into  two  groups.  One  group  which  grew  at  55°C  or  lower  was 
identified  as  Bacillus  coagulans .  The  other  group  which  grew 
at  over  65°C  was  identified  as  Bacillus  stearothermophilus . 
Both  groups  of  organisms  represent  a  rich  source  of  bacterial 
genes  encoding  enzymes  for  the  depolymerization  of  cellulose 
to  cellobiose  and  glucose  and  for  the  depolymerization  of 
other  polysaccharides. 

Little  was  previously  known  about  the  mechanisms  of 
cellobiose  transport  and  metabolism  in  Gram  positive  bacteria. 
This  was  extensively  investigated  in  one  of  the  B. 
stearothermophilus  isolates,  strain  XL-65-6.  The  cellobiose 
transport  system  in  this  strain  consisted  of  a  multicomponents 
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phosphoenolpyruvate-dependent  phosphotransferase  system  (PTS)  . 
The  three  genes  encoding  the  cellobiose-specif ic  permease  and 
the  gene  encoding  a  proposed  phospho-6-glucosidase  were  cloned 
and  sequenced.  These  were  found  to  be  organized  as  an  operon 
which  is  present  as  a  single  copy  on  the  B.  stearothermophilus 
chromosome.  The  celB  product  forms  the  major  membrane- 
spanning  channel  (EIICCeL)  for  this  transport  system.  Products 
of  celA  and  celD  are  cytoplasmic  components  of  enzyme  II 
(EIIBCel  and  EIIACel)  .  The  celC  gene  encodes  the  glycohydrolase 
which  serves  as  a  cleavage  enzyme.  It  is  likely  that  this 
enzyme  is  phospho-6-D-glucopyranosidase.  These  genes  were 
functionally  expressed  in  recombinant  Escherichia  coli. 

Genes  (ptsH  and  ptsl)  encoding  the  PTS  general  proteins, 
HPr  and  enzyme  I,  were  also  cloned  and  sequenced  from  strain 
XL-65-6.  These  two  genes  together  with  a  third,  ptsT,  were 
also  organized  as  an  operon.  This  ptsHIT  operon  exists  as  a 
single  copy  in  the  B.  stearothermophilus  chromosome.  Deduced 
amino  acid  sequences  of  B.  stearothermophilus  ptsH  and  ptsl 
are  most  similar  to  those  from  B.  subtilis  although  they 
exhibit  a  high  degree  of  similarity  with  homologues  from  many 
other  bacteria.  No  gene  similar  to  ptsT  was  found  in  the 
current  GenBank  data  base.  The  exact  function  of  ptsT  remains 
undetermined. 

Results  from  these  studies  provided  the  basis  for  a  model 
of  cellobiose  transport  and  metabolism  in  B. 
stearothermophilus . 
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CHAPTER  1 
INTRODUCTION 


Thermophiles  are  a  fascinating  group  of  microorganisms 
which  have  received  considerable  interest  in  recent  years 
because  of  their  potential  for  biotechnological  applications. 
Research  on  thermophiles  is  expanding  into  many  different 
fields  including  biochemistry,  enzymology,  genetics,  and 
molecular  biology.  The  biotechnological  interest  will  further 
stimulate  basic  research  on  this  group  of  microorganism  and 
increase  our  understanding  of  thermophiles  in  the  future 
(Kristjansson  and  Stetter,  1992) . 

The  genus  Bacillus  is  one  of  the  earliest  bacteria  to  be 
described.  Species  within  this  genus  have  been  important  as 
models  for  studying  biological  problems,  including 
differentiation,  secretion,  DNA  replication,  DNA  repair, 
chemotaxis,  genetic  transformation,  and  translation  apparatus 
(Skepecky  and  Hemphill,  1992).  They  have  also  been  utilized 
in  a  wide  range  of  industrial  processes  due  to  metabolic 
diversity  and  low  reported  incidence  of  pathogenicity 
(Harwood,  1989). 

Interest  in  cellulose  decomposition  and  in  the 
microorganisms  that  produce  enzyme  systems  for  cellulose 
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2 
depolymerization  have  been  stimulated  by  the  perceived  need  to 
exploit  cellulose  as  a  renewable  source  of  energy  and  chemical 
feedstocks,  and  to  improve  the  efficiency  of  digestion  of 
fodder  by  ruminants  (Coughlan  and  Mayer,  1992) .  Cellulose  is 
the  most  abundant  carbohydrate  in  nature.  It  has  been 
estimated  that  7  x  1011  metric  tons  of  cellulose  exist  in 
primary  sources  at  any  one  time,  and  that  approximately  4  x 
1010  tons  are  replenished  each  year  by  photosynthesis 
(Coughlan,  1985) .  Unlike  fossil  fuels,  cellulose  is  clearly 
a  vast  and  renewable  resource.  In  addition  to  primary 
sources,  cellulose  also  represents  approximately  50%  of  the 
dry  weight  of  many  secondary  sources  of  biomass  such  as 
agricultural,  forest,  industrial,  and  domestic  waste.  Huge 
amounts  of  these  materials  are  also  generated  each  year 
(Coughlan  and  Mayer,  1992) . 

Cellobiose  is  one  of  the  primary  soluble  extracellular 
products  of  microbial  cellulose  degradation.  Despite  the 
abundance  of  cellobiose  in  the  environment,  little  is  known 
about  its  entry  into  cells  and  its  intracellular  metabolism. 
Many  Gram  positive  and  negative  bacteria  have  been  reported  to 
contain  cell-associated  6-glucosidases  (Coughlan  and  Mayer, 
1992) .  Cellulomonas  species  have  cellobiose  phosphorylases 
which  are  presumed  to  be  involved  in  intracellular  cellobiose 
metabolism  (Coughlan  and  Mayer,  1992).  In  E.  coli,  two 
cryptic  operons  have  been  described  at  a  molecular  level  which 
provide  the  potential  to  transport  cellobiose  into  the  cell  by 
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a  phosphoenolpyruvate-dependent  phosphotransferase  system 
(PTS)  and  to  cleave  cellobiose  into  monomer  (Hall  and  Xu, 
1992;  Parker  and  Hall,  1990).  Both  operons  have  different 
gene  organizations  but  potentially  encode  a  cellobiose- 
specific  enzyme  II  complex  and  a  phospho-6-glucosidase.  This 
is  the  only  organism  for  which  cellobiose  uptake  has  been 
defined.  But  this  organism  typically  does  not  have  a 
functional  cellobiose  transport  system. 

Different  organisms  often  transport  the  same  sugars  by 
alternative  mechanisms.  For  example,  E.  coli  and  Bacillus 
species  use  non-PTS  to  take  up  lactose  into  the  cell  while 
Staphylococcus  and  Streptococcus  species  initiate  the 
metabolism  of  this  sugar  by  the  PTS  (Saier  and  Yamada,  1987). 
However,  no  information  is  available  about  the  uptake  of 
cellobiose  in  any  Gram  positive  bacteria. 

PTS  is  a  complex  system  which  is  used  to  transport  a 
variety  of  sugars  in  different  bacteria.  This  system  consists 
of  several  interacting  proteins,  including  cytoplasmic  and 
membrane  proteins  (Postma  et  al.,  1993).  In  addition  to  its 
primary  function  of  carbohydrate  translocation,  the  PTS  also 
has  a  variety  of  other  important  physiological  functions  in 
bacterial  cells.  One  such  function  is  to  regulate  the  order 
of  sugar  utilization  when  mixtures  of  sugars  are  present.  For 
example,  in  media  containing  sugars  transported  by  the  PTS 
(PTS-sugar)  and  sugars  transported  by  other  system  (non-PTS 
sugar) ,  PTS-sugars  are  utilized  prior  to  induction  of  the 
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catabolic  systems  for  remaining  sugars  (diauxic  growth) .  The 
PTS  can  also  function  as  a  signal  transduction  system  through 
which  bacteria  respond  to  the  presence  of  PTS-sugars  by 
positive  chemotaxis.  The  PTS  may  also  serve  as  a  link  between 
carbon  and  nitrogen  metabolism  in  Gram  positive  bacteria.  One 
domain  of  the  B.  subtilis  levR  gene  product,  a  protein 
involved  in  the  regulation  of  PTS  levanase  operon,  is 
homologous  to  a  part  of  NifA  and  NtrC  proteins,  which  are 
activators  of  <j54-dependent  transcription  (Debarbouille  et  al.  , 
1991)  .  a54  is  required  for  the  transcription  of  a  number  of 
genes  important  for  glutamate  production,  especially  under  the 
conditions  of  nitrogen  limitation. 

The  primary  aim  of  this  dissertation  is  to  investigate 
the  genetic  organization  of  cellobiose  uptake  system  in  a 
thermophilic  species  of  Bacillus.  The  specific  objectives 
were  as  follows: 

1.  To  isolate  and  identify  thermophilic  cellulolytic  Bacillus 

strains  from  nature. 

2.  To  characterize  cellobiose  uptake  genes  at  a  molecular 
level  from  one  isolate. 

3.  To  construct  cellobiose-utilizing  strains  of  E.    coli   by 
introducing  the  Bacillus   genes  into  E.    coli. 


CHAPTER  2 
REVIEW  OF  LITERATURE 


Thermophilic  Microorganisms 

Definition  of  Thermophilic  Microorganisms 

Temperature  is  one  of  the  most  important  variables  in  our 
environment.  The  classification  of  living  organisms  based  on 
their  relation  to  temperature  has  therefore  always  been 
considered  as  one  of  the  most  basic  elements  of  biological 
systematics.  Microorganisms  are  frequently  divided  into  three 
broad  classes  on  the  basis  of  temperature  ranges  for  growth, 
that  is  psychrophiles,  mesophiles,  and  thermophiles  (Neidhardt 
et  al.,  1990).  Brock  (1986)  has  suggested  a  definition  of  a 
thermophile  boundary  at  55°C  to  60°C  based  on  two  main 
arguments:  first,  temperatures  lower  than  50°C  are  widespread 
on  earth,  whereas  temperatures  greater  than  55°C  to  60°C  are 
much  rarer  in  nature.  Second,  60°C  is  the  upper  temperature 
limit  for  eucaryotlc  life.  However,  bacteria  that  grow  at 
50°C  or  above  are  traditionally  called  thermophiles 
(Kristjansson  and  Stetter,  1992;  Neidhardt  et  al.,  1990). 

The  upper  temperatures  for  different  thermophilic 
bacteria  vary  considerably.  In  general,  nonphotosynthetic 
organisms  are  able  to  grow  at  higher  temperatures  than 
photosynthetic  forms;  structurally  less  complex  organisms  can 
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6 
grow  at  higher  temperatures  than  more  complex  organisms.  The 
upper  growth  temperature  for  cyanobacteria  and  other 
phototrophic  bacteria  is  70°C  to  73°C,  for  organotrophic 
bacteria  is  over  90°C,  while  archaebacteria  can  grow  at  over 
100°C.  However,  not  all  organisms  from  these  groups  are 
thermophiles.  Usually  only  a  few  species  or  genera  are  able 
to  function  successfully  near  the  upper  temperature  limits 
(Brock  and  Madigan,  1991) . 
General  Physiological  Characteristics  of  Thermophiles 

Thermophilic  bacteria  exhibit  a  wide  range  of 
capabilities  for  energy  and  nutrition  metabolism:  phototrophy 
and  chemotrophy,  autotrophy  and  heterotrophy,  aerobiosis  and 
anaerobiosis.  Examples  of  thermophiles  capable  of 
phototrophic  growth  include  16  cyanobacteria  species  and  some 
purple  and  green  bacteria  (Brock  and  Madigan,  1991) .  Although 
these  organisms  use  light  as  an  energy  source,  most 
thermophiles  are  chemotrophs,  using  either  inorganic  or 
organic  chemicals  as  energy  sources.  Thermophiles  that  grow 
autotrophically  include  many  of  the  methanogens ,  Bacillus 
schlegelii  and  Clostridium  thermoautotrophicum.  However,  like 
the  majority  of  thermophilic  bacteria,  these  organisms  also 
grow  heterotrophically  (Cato  et  al.,  1986;  Claus  and  Berkeley, 
1986) .  Many  thermophiles,  including  most  of  the  thermophilic 
members  of  the  genera  Bacillus ,  Clostridium,  Thermomicrobium , 
and  Thermus  only  grow  heterotrophically.  Heterotrophic 
thermophiles  utilize  a  wide  range  of  carbon  sources  for 
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growth,  including  carbohydrates,  protein,  lipids,  organic 
acids,  alcohols,  aromatic  compounds  as  well  as  many  other 
compounds  (Sundaram,  1986) .  Thermophilic  Bacillus  strains 
have  been  isolated  which  are  able  to  degrade  methanol 
(Dijkhuizen  et  al.,  1988;  Al-Awadhi  et  al.,  1989),  ethanol 
(Al-Awadhi  et  al.,  1989),  phenol,  and  cresol  (Buswell  and 
Twomey,  1975;  Buswell,  1975).  Examples  of  aerobic 
thermophiles  include  the  genus  Thermus  (Williams,  1992)  ,  and 
thermophilic  species  in  the  genus  Bacillus  (Claus  and 
Berkeley,  1986) .  For  these  organisms,  oxygen  is  involved  in 
the  energy-generating  step,  serving  as  a  terminal  electron 
acceptor.  In  contrast  to  this  group,  some  thermophiles,  such 
as  thermophilic  species  in  the  genus  Clostridium,  are 
anaerobes.  These  organisms  produce  energy  through  anaerobic 
fermentative  pathways. 
Adaptation  of  Thermophiles  to  High  Temperature 

Considerable  information  has  been  obtained  about  the 
ecology,  physiology,  metabolic  capacities,  and  biochemical  and 
physical  properties  of  cellular  components  of  thermophiles. 
It  is  clear  that  the  ability  to  grow  at  high  temperature 
cannot  be  explained  by  a  single  mechanism  but,  rather,  that 
thermophiles  appear  to  use  a  variety  of  mechanisms  (Welker, 
1993) . 

The  content  of  guanine  and  cytosine  (G+C)  often 
correlates  with  maximum  growth  temperature,  although 
Clostridium   species  are  an  exception  (Welker,  1976) .   The  DNA 
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of  thermophiles  showed  a  consistently  higher  G+C  content  than 
that  of  DNA  from  mesophiles.  Thermal  melting  profiles 
demonstrated  that  DNA  from  thermophiles  had  higher  melting 
temperatures  than  DNA  from  mesophiles.  The  greater  stability 
to  thermally- induced  strand  separation  is  attributable  to  the 
more  extensive  hydrogen  bonding  that  occurs  with  a  higher  G+C 
content. 

Stability  of  the  cellular  membrane  is  a  determinant  of 
upper  temperature  limit  for  growth  of  organisms  (Sundaram, 
1986) .  The  structure  of  cell  membrane  of  thermophilic 
bacteria,  like  those  of  other  biological  membranes,  conforms 
to  the  model  of  a  lipid  bilayer.  However,  the  lipids  in 
thermophiles  tend  to  have  a  greater  abundance  of  fatty  acids 
with  higher  melting  points  than  do  the  lipids  of  mesophiles. 
A  significant  property  of  the  phospholipid  bilayer  of  the 
bacterial  membrane  is  that  it  can  undergo  thermotropic, 
reversible  transition  between  an  ordered,  rigid  gel  or  solid 
phase  and  a  more  fluid  liquid-crystalline  phase.  The  change 
from  gel  to  liquid-crystalline  phase  involves  the  cooperative 
melting  of  the  hydrocarbon  chains  in  the  interior  of  the 
bilayer,  and  this  phase  transition  behavior  may  set  the 
minimal  and  maximal  temperatures  for  growth  (Sundaram,  1986) . 
The  stability  of  proteins  also  determine  the  maximal  growth 
temperature  (Sundaram,  1986) .  A  large  number  of  proteins  from 
thermophiles  have  been  isolated  in  pure  form.  Generally, 
these  proteins  are  more  thermostable  than  their  homologous 
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counterparts  from  mesophiles  (Sundaram,  1986) .  Amelunxen  and 
Lins  (1968)  compared  the  thermostability  of  11  enzymes  from 
the  thermophile  Bacillus  stearothermophilus  and  the  mesophile 
Bacillus  cereus.  They  found  that  nine  of  these  enzymes  from 
B.  stearothermophilus  were  more  thermostable  than  the 
corresponding  enzymes  from  B.  cereus.  However,  pyruvate 
kinase  and  glutamate-oxaloacetate  transaminase  from  both 
bacteria  exhibited  similar  rates  of  inactivation  at  70°C. 
Thermophilic  adaptation  has  been  correlated  with 
preferential  alterations  of  the  amino  acid  composition  in 
terms  of  'traffic  rules'  governing  gross  amino  acid 
compositions  of  mesophilic  and  thermophilic  proteins.  Mozhaev 
and  Martinek  (1984)  concluded  that  many  thermophilic  proteins 
are  deficient  in  polar  amino  acids,  mostly  serine  and 
threonine.  Substitution  of  the  inner  serine  and  threonine  for 
nonpolar  amino  acid  residues  was  proposed  as  a  mechanism  to 
stabilize  proteins  to  high  temperature.  DiRuggiero  et  al. 
(1993)  reported  that  thermostable  glutamate  dehydrogenase 
(GDH)  from  hyperthermophilic  Archaeon  ES4  (optimal  growth 
temperature  98°C  and  maximum  growth  temperature  110°C)  had  a 
relatively  high  hydrophobicity  and  a  low  number  of  sulfur- 
containing  residues  compared  with  mesophilic  GDHs.  However, 
after  statistically  comparing  dihydrofolate  reductases  from  22 
different  organisms,  Bohm  and  Jaenicke  (1994)  concluded  that 
simple  'traffic  rules'  were  insufficient  to  explain  the 
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mechanisms  of  thermophilic  adaptation  or  to  predict  the 
stabilization  or  destabilization  of  proteins. 

Indeed  proteins  from  thermophiles  are  rather  similar  to 
their  mesophilic  counterparts  in  most  respects,  including 
size,  subunit  structure,  gross  higher  order  structural 
parameters,  and  modulation  of  activity  by  metal  ions  and  other 
effectors  (Amelunxen  and  Murdock,  1978)  .  Attempts  to  transfer 
heat  stability  and  heat  lability  by  mixing  cell-free  extracts 
from  thermophiles  and  mesophiles  failed  to  change  the  original 
properties  of  individual  enzymes  (Amelunxen  and  Lins,  1968; 
Koff ler  and  Gale,  1957) .  These  results  established  that  the 
functioning  of  proteins  at  high  growth  temperatures  of 
thermophiles  is  not  merely  dependent  on  the  presence  of 
stabilizing  factors  in  the  cellular  environment  of  the 
thermophile  and  that  intrinsic  structural  stability  of 
proteins  is  the  more  important  basis  for  thermophily 
(Sundaram,  1986) . 

Amelunxen  and  Murdock  (1978)  have  suggested  that  the 
structural  stabilization  of  thermophilic  proteins  is  primarily 
achieved  through  extra  hydrogen  bonds,  ionic  interactions  and 
apolar  interactions  in  certain  critical  parts  of  the  protein 
molecules.  After  comparing  the  structures  of  triosephosphate 
isomerase  from  thermophilic  bacteria,  Moraxella  sp.  ,  and 
mesophilic  chicken,  Rentier-Delrue  et  al.  (1993)  concluded 
that  the  adaptability  of  this  enzyme  to  high  temperature 
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appeared  to  be  favored  by  better  stabilizing  residues  for  the 
helix  dipole  as  well  as  better  helix-forming  residues. 

Also,  there  is  evidence  that  thermostability  of  a  protein 
can  be  altered  by  a  single  amino  acid  change.  The 
thermostability  of  E.  coli  ribonuclease  HI  was  increased  by 
replacement  of  single  amino  acid  residues  (Ishikawa  et  al. 
1993).  Crystal  structures  of  these  mutant  proteins  revealed 
that  only  changes  in  the  local  structure  around  each  mutation 
site  are  essential  for  the  increase  in  thermostability.  After 
His65  was  replaced  by  Pro,  the  mutant  protein  was  stabilized 
because  of  a  decrease  in  the  entropy  of  the  unfolded  state, 
without  a  change  in  the  native  backbone  structure.  After 
Lys95  was  replaced  by  Gly,  the  left-handed  backbone  structure 
in  the  typical  3:5  type  loop  was  eliminated.  The  mutant 
protein  with  a  replacement  of  Lys95  by  Asn  was  stabilized  due 
to  formation  of  a  hydrogen  bond  between  the  side-chain  N5-atom 
of  the  asparagine  residue  and  the  main-chain  carboxyl  oxygen 
within  the  same  residue. 

Inherent  structural  stability  alone  may  not  be  sufficient 
to  ensure  the  survival  and  functioning  of  proteins  in 
thermophilic  cells.  Thermophilic  proteins  display  a  rather 
broad  spectrum  of  stability  in  vitro,  suggesting  that  they  are 
not  all  equally  stable.  Many  enzymes  from  thermophiles  are 
stabilized  in  vitro  by  supplements,  such  as  substrates,  metal 
ions,  allosteric  effectors,  and  electrolytes,  all  of  which  are 
present  in  the  growing  bacterial  cells  (Sundaram,  1986)  . 
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Glutamine  synthetase  from  Bacillus  caldolytics  was 
demonstrated  to  be  stabilized  by  the  binding  of  metal  ions  and 
substrate  (Merkler  et  al.,  1988).  In  vitro  stability  of 
glyceraldehyde-3-phosphate  dehydrogenase  from  Bacillus 
coagulans  was  enhanced  by  increasing  the  ionic  strength  to  1.8 
M  with  many  neutral  salts  (McLinden  et  al.,  1986).  Calcium 
ions  confer  stability  to  several  extracellular  enzymes,  such 
as  a-amylase  (Yutani,  1976),  thermolysin  (Tajima  et  al., 
1976),  and  proteases  (Sidler  and  Zuber,  1977). 

Insights  into  the  molecular  mechanisms  of  thermophily 
have  been  hampered  because  little  is  known  about  the  genetic 
and  molecular  basis  of  thermophily.  This  gap  is  directly 
related  to  lack  of  a  genetic  exchange  systems  for  the 
manipulation  and  genetic  analysis  of  most  thermophilic  species 
(Welker,  1993) .  Lindsay  and  Creaser  (1975)  reported  that 
after  a  mesophilic  B.  subtilis  strain  was  transformed  using 
DNA  isolated  from  a  obligately  thermophilic  B.  caldolyticus 
strain,  the  resulting  recombinant  of  B.  subtilis  strain  was 
able  to  grow  above  70°C.  These  investigators  suggested  that 
thermophily  was  controlled  by  a  small  number  of  genes  acting 
at  the  ribosomal  or  tRNA  level  affecting  translation  resulting 
in  expression  of  more  thermostable  enzyme  variants.  An  a- 
glucosidase  purified  from  the  high-temperature-growth 
transformant  had  been  confirmed  to  have  changes  in  the  amino 
acid  composition.  Although  this  enzyme  had  much  higher 
optimum  growth  temperature  than  its  counterpart  in  its 
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mesophilic  parent,  many  physiological  characteristics  remained 
unchanged  (Krohn  and  Lindsay,  1991) .  Similar  results  have 
been  reported  by  Droffner  and  Yamamoto  (1985) .  Most  recently, 
Stahl  (1991)  isolated  3  plasmids  from  thermophilic  B. 
stearothermophilus ,  which  could  be  involved  in  thermophily 
since  strains  cured  of  these  plasmids  were  no  longer  obligate 
thermophiles.  Furthermore,  when  these  plasmids  were 
transformed  into  a  mesophilic  B.  subtilis  strain,  the 
resulting  transformants  shifted  their  optimum  growth 
temperature  about  10°C  higher,  and  about  6°C  higher  for  the 
maximum  growth  temperature.  Based  on  these  experiments,  Stahl 
concluded  that  Lindsay  and  Creaser's  results  (1975)  might  be 
due  to  the  presence  in  the  donor  strain  of  plasmids  involved 
in  thermophily. 
Evolution  of  Thermophilic  Microorganisms 

Thermophilic  bacteria  are  distributed  among  all  main 
groups  in  bacterial  genetic  tree.  However,  it  is  interesting 
to  note  that  they  are  usually  the  oldest  types  in  their 
respective  groups  (for  example  Thermotoga,  Thermomicrobium , 
Chloroflexus,  and  Thermous)  ,  and  even  the  oldest  of  the 
bacterial  kingdom.  These  results  suggest  that  thermophily 
arose  very  early  in  the  evolution  of  bacteria.  Indeed,  the 
ancestral  bacterium  might  have  been  a  thermophile 
(Kristjansson  and  Stetter,  1992) . 
Applications  of  Thermophilic  Microorganisms 

Thermophiles    offer    some   major    advantages    for 
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biotechnology.  The  most  attractive  attribute  of  thermophiles, 
from  a  biotechnological  point  of  view,  is  that  they  produce 
enzymes  capable  of  catalyzing  biochemical  reactions  at 
temperatures  markedly  higher  than  those  of  conventional 
organisms.  The  main  advantages  of  increased  temperature  are 
generally  higher  reaction  rates,  higher  solubility  of  most 
chemicals,  and  increased  fluidity  and  diffusion  rates.  High 
temperature  also  prevents  contamination  from  mesophilic 
microorganisms.  In  addition,  many  enzymes  from  thermophiles 
are  more  stable  at  conventional  temperatures,  thus  prolonging 
the  shelf  life  of  commercial  products  (Brock,  1986) .  Interest 
in  thermostable  enzymes  from  thermophiles  has  grown.  The 
variety  of  thermostable  enzymes  used  in  industrial  application 
has  been  steadily  increasing,  mainly  as  replacements  for 
thermolabile  enzymes  in  existing  processes.  For  example, 
thermostable  amylase  from  B.  licheniformis  and  B. 
stearothermophilus  have  replaced  thermolabile  amylase  from  B. 
subtilis  (Zamost  et  al.,  1991).  The  main  applications  for 
thermostable  enzymes  have  been  starch  liquif ication  using 
amylases  and  proteases  for  food  processing  and  detergents 
(Zamost  et  al.,  1991) .  Other  thermostable  enzymes  with  a  wide 
range  of  characteristics  remain  to  be  exploited  and  applied. 
The  application  of  thermophiles  is  not  limited  to  industrial 
enzyme  processes.  There  exist  many  other  potential 
applications,  such  as  fermentation,  waste  treatment,  and 
microbial  leaching  (Brock,  1986) . 
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Thermophilic  Bacillus 

In  Bergey's  Manual  of  Systematic  Bacteriology  (Claus  and 
Berkeley,  1986) ,  the  thermophilic,  aerobic  or  facultative 
anaerobic,  spore-forming,  Gram  positive  rods  are  placed  in  the 
species    B.    acidocaldarius,  B.  schlegelii,  B. 

stearothermophilus,  B.  coagulans .  and  B.  licheniformis.  The 
first  three  species  are  able  to  grow  at  65°C  or  higher,  and 
the  later  two  are  able  to  grow  at  55°C.  The  differential 
properties  of  these  organisms  are  listed  in  Table  1. 

B.  acidocaldarius  was  first  isolated  by  Darland  and  Brock 
(1971)  from  hot  springs  in  Yellowstone  and  from  fumaroles  in 
Hawaii.  This  organism  appears  to  be  restricted  to  thermal 
springs  and  the  surrounding  soils  (Priest,  1989) .  It  has  a 
temperature  range  between  45°C  and  70°C  and  a  pH  range  between 
pH  2  and  pH  6  (Darland  and  Brock,  1971) .  One  of  the  unique 
features  of  B.  acidocaldarius  is  its  fatty  acid  profile.  Like 
other  Bacillus  spp.,  it  possesses  MK-7  as  a  major  menaquinone 
component,  but  in  addition  it  also  possesses  the  unique  fatty 
acid  u-cyclohexane  (Minnikin  and  Goodfellow,  1981) . 

B.  schlegelii  is  a  thermophilic  hydrogen-oxidizing 
organism.  It  was  isolated  by  Aragno  (1978)  from  the 
superficial  layer  of  the  sediment  of  a  small  eutrophic  lake  in 
Switzerland  and  was  described  by  Schenk  and  Aragno  (1979)  .  It 
is  an  obligate  aerobe.  The  Gram  reaction  is  variable,  but 
cell  wall  structure  of  this  organism  is  typical  of  a  Gram 
positive  bacteria  (Aragno,  1992)  .  The  optimal  temperature  for 
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Table  l.  Differential  Properties  of  Thermophilic  Bacillus 


Properties8 


Bco 


Bli 


Bac 


Bsc 


Bst 


Growth  at 
3  0°C 
55°C 
65°C 

+ 
+ 

+ 
+ 

+ 
+ 

Growth  in 
5%  NaCl 
7%  NaCl 

- 

+ 
+ 

ND 
ND 

Acid  from 
D-Glucose 

+ 

+ 

+ 

Hydrolysis  of 
Casein 
Gelatin 
Starch 

d 

+ 

+ 
+ 
+ 

ND 
ND 

+ 

Growth  at 
pH  6.8 

+ 

+ 

- 

Autotrophic  with 
H2  +  C02  or  CO 

- 

_ 

_ 

d 

+ 
+ 


Source:  Claus  and  Berkeley,  1986 

Note:  Bco:  Bacillus  coagulans 

Bli:  Bacillus  licheniformis 

Bac:  Bacillus  acidocaldarius 

Bsc:  Bacillus  schlegelii 

Bst:  Bacillus  stearothermophilus 

+  :  90%  or  more  strains  are  positive 
-:  90%  or  more  strains  are  negative 
d:  11-89%  of  strains  are  positive 
ND:  no  data  available 


17 
growth  is  about  70°C,  with  no  growth  at  37°C  or  80°C.  The 
optimal  pH  for  growth  is  between  pH  6  to  pH  7  (Schenk  and 
Aragno,  1979).  B.  schlegelii  is  facultatively 
chemolithotrophic  using  either  H2  as  the  electron  donor  and  C02 
as  the  carbon  source  or  CO  to  serve  both  requirements  (Claus 
and  Berkeley,  1986) . 

B.  stearothermophilus  is  a  highly  heterogeneous  species. 
Wolf  and  Sharp  (1981)  subdivided  this  species  into  three 
groups,  while  Bartolomeo  et  al.  (1991)  clustered  this  species 
into  four  groups  on  the  basis  of  morphological,  physiological, 
and  biochemical  characteristics  as  well  as  the  DNA  base 
compositions  of  133  thermophilic  Bacillus   strains. 

B.  coagulans  is  an  important  food  spoilage  agent  involved 
in  the  coagulation  of  canned  milk  and  flat  souring  of 
carbohydrate  containing  canned  foods  through  the  production  of 
high  concentration  of  L-(+) -lactic  acid  (Frazier,  1958).  It 
was  originally  isolated  from  spoiled  canned  milk  by  Hammer  in 
1915.  This  organism  is  considered  to  be  a  facultative 
thermophile,  growing  well  at  45°C  to  55°C  with  some  strains 
growing  up  to  65°C  (Wolf  and  Sharp,  1981) .  Early  studies 
indicated  considerable  heterogeneity  among  members  of  this 
species  with  variations  in  morphology  of  both  cells  and  spores 
(Smith  et  al.,  1952;  Sei  et  al.,  1978).  Wolf  and  Barker 
(1968)  divided  this  species  into  two  distinct  physiological 
types.   On  the  basis  of  a  more  recent  study  by  Bartolomeo  et 
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al.  (1991),  this  species  was  further  divided  into  three 
groups. 

Although  some  investigators  have  proposed  species  names 
for  certain  thermophilic  Bacillus  strains,  These  strains  have 
not  been  recognized  as  species  in  Bergey's  Manual  (Claus  and 
Berkeley,  1986) . 

Bacterial  Cellulase  System 

A  complete  cellulase  system  is  defined  as  that  which  can 
catalyze  extensive  hydrolysis  of  crystalline  cellulose 
(Coughlan  and  Mayer,  1992).  Such  systems  require  the 
concerted  action  of  several  different  types  of  enzymes,  among 
which  mainly  include:  (a)  endoglucanases  (EC  3.2.1.4,  CMCase) , 
(b)  cellobiohydrolases  (EC  3.2.1.91,  exoglucanase) ,  and  (c)  B- 
glucosidases  (EC  3.2.1.21).  The  endoglucanase  cleaves  B- 
glucosidic  bonds  randomly  in  the  internal  region  of  the 
cellulose  molecules.  Typically,  these  enzymes  are  inactive 
against  crystalline  cellulose  although  they  may  yield  a  small 
amount  of  reducing  sugar  by  hydrolysis  of  the  amorphous 
regions  of  crystalline  substrates.  This  group  of  enzymes, 
generally,  is  active  against  acid-swollen  amorphous  cellulose, 
soluble  derivatives  of  cellulose  such  as  carboxyl  methyl 
cellulose  (CMC),  and  eel lool igosacchar ides . 
Cellobiohydrolases  attack  cellulose  molecules  stepwise  from 
the  nonreducing  ends,  liberating  soluble  cellobiose  units. 
These  enzymes  can  act,  albeit  slowly,  against  crystalline 
substrates  such  as  Avicel.   They  are  also  active  against 
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amorphous  celluloses  and  cellooligosaccharides,  but  are 
inactive  against  cellobiose  or  substituted  soluble  celluloses 
such  as  CMC.  The  6-glucosidases  hydrolyze  cellobiose  to 
glucose  and  catalyze  the  removal  of  glucose  from  the  low 
molecular  weight  cellooligosaccharides,  but  they  are  inactive 
against  crystalline  or  amorphous  cellulose  (Coughlan  and 
Mayer,  1992;  Wood  and  Bhat,  1988).  In  addition  to  the  above 
three  major  types  of  cellulases,  many  other  types  of  enzymes, 
such  as  phospho-6-glucosidase,  cellobiose  phosphorylase, 
cellodextrin  phosphorylase,  may  also  play  roles  in  various 
organisms  (Coughlan  and  Mayer,  1992;  El  Hassouni  et  al., 
1992)  . 

At  least  35  bacterial  genera  have  been  reported  to 
contain  cellulolytic  species,  many  of  which  can  grow  on 
cellulose  and  produce  enzymes  that  catalyze  the  degradation  of 
soluble  derivatives  of  cellulose  or  the  amorphous  regions  of 
otherwise  crystalline  cellulose.  However,  very  few  bacteria 
synthesize  the  complete  enzyme  system  needed  for  the  extensive 
hydrolysis  of  the  crystalline  cellulose  in  nature  (Coughan  and 
Mayer,  1992) . 

Sugar  Transport  Systems  in  Bacteria 

Bacteria  employ  various  transport  systems  for  uptake  of 
different  sugars.  In  E.  coli,  almost  all  of  the  transport 
mechanisms  (including  facilitated  diffusion,  shock-sensitive 
active  transport,  and  secondary  active  transport)  participate 
in  the  uptake  of  one  or  more  sugars  or  sugar  alcohols 
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(Neidhardt  et  al.,  1990).  Glycerol  enters  E.  coli  by 
facilitated  diffusion  with  the  aid  of  a  glycerol-specif ic 
transmembrane  protein.  This  mechanism  does  not  concentrate 
the  substrate  within  the  cytoplasm.  Maltose  enters  E.  coli  by 
a  shock-sensitive  system.  This  system  is  made  up  of  four 
distinct  proteins,  a  binding  protein,  two  transmembrane 
proteins,  and  an  energy-transducing  protein.  When  maltose 
enters  the  periplasm,  it  binds  tightly  to  the  binding  protein; 
the  binding  protein  then  passes  maltose  to  sites  on  the 
transmembrane  proteins;  the  energy-transducing  protein 
hydrolyze  a  high-energy  phosphate  bond  of  ATP,  the  energy  of 
which  changes  the  conformation  of  transmembrane  proteins  such 
that  the  bound  maltose  is  released  on  the  membrane's  inner 
surface.  Lactose  enters  E.  coli  by  a  proton  symport  and 
melibiose  by  a  sodium  symport.  Symport  is  the  transport  of 
two  substrates  simultaneously  in  the  same  direction  by  a 
single  carrier:  if  one  of  these  substrates  flows  down  its 
concentration  gradient,  the  other  flows  with  it.  Glucose 
enters  E.  coli  by  a  phosphoenolpyruvate-dependent 
phosphotransferase  system  (PTS) ,  which  will  be  described  in 
more  detail  in  next  section.  Interestingly,  the  uptake  of  a 
particular  sugar  does  not  follow  a  pattern:  for  example 
lactose  is  taken  into  E.  coli  by  proton  symport  but  into 
Staphylococcus  aureus   by  a  PTS. 

Phosphoenolpyruvate-Dependent  Phosphotransferase  System 
Bacteria  take  up  many  sugars  into  the  cell  by  a  complex 


21 
system,  phosphoenolpyruvate-dependent  phosphotransferase 
system  (PTS) .  This  system  consists  of  several  interacting 
proteins  and  has  a  variety  of  physiological  functions  in 
bacterial  cells  in  addition  to  sugar  transport  (Postma  et  al., 
1993)  . 

The  PTS  was  discovered  in  cell  extracts  of  E.  coli  by 
Kundig  et  al.  (1964)  as  a  system  which  catalyzes  the 
phosphorylation  of  a  number  of  carbohydrates  with  PEP  as  the 
phosphoryl  donor.  This  system  consists  of  two  general  energy- 
coupling  proteins,  enzyme  I  (EI)  and  HPr,  as  well  as  a  sugar- 
specific  permease,  commonly  referred  as  the  enzyme  II  complex 
(EII)  (Saier  and  Reizer,  1992) .  The  general  reaction  scheme 
of  the  system  is  illustrated  as  following  (Postma  et  al., 
1993;  Saier  and  Reizer,  1992): 

P-enolpyruvate  +  EI  «•  P-EI  +  pyruvate       (1) 

P-EI  +  HPr  -  P-HPr  +  EI       (2) 

P-HPr  +  EIIA (domain  or  protein)  -  P-EIIA  +  HPr       (3) 

P-EIIA  +  EIIB  (domain  or  protein)  -  P-EIIB  +  EIIA       (4) 

P-EIIB  +  EIIC  (domain  or  protein)  *»  P-EIIB  +  EIIA       (5) 

P-EIIC  +  carbohydrate(out)  -  EIIC  +  carbohydrate-P(jn)      (6) 

In  most  cases,  EI  and  HPr  are  soluble,  cytoplasmic 
proteins  that  participate  in  the  phosphorylation  of  all  PTS 
carbohydrates  in  a  given  organism,  and  thus  are  referred  to  as 
general  PTS  proteins  (Postma  et  al.,  1993). 
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Enzyme  I 

EI  has  been  purified  from  a  variety  of  organisms  (Postma 
et  al.,  1993) .  The  monomer  molecular  weight  ranges  from  about 
64,000  in  E.  coli  and  S.  typhimurium  to  about  85,000  in 
Staphylococcus     aureus.  However,   evidence  suggests  that 

autophosphorylation  of  EI  (reaction  I)  requires  a  dimeric  form 
of  the  protein  (Kukuruzinska  et  al.,  1982;  Misset  et  al., 
1980;  Weigel  et  al.,  1982).  The  Els  from  most  organisms 
consist  of  identical  monomers  that  self-associate  to  form 
dimers.  The  exception  is  the  EI  from  Mycoplasma  capricolum, 
which  is  a  tetramer  composed  of  three  different  subunits 
(Meadow  et  al.,  1990).  Phosphorylation  occurs  at  the  N-3 
position  of  a  histidyl  residue  of  EI  during 
autophosphorylation  with  PEP  (Alpert  et  al.,  1985;  Weigel  et 
al.,  1982).  Studies  on  S.  typhimurium  EI  shows  that  this 
enzyme  requires  Mg2*,  Mn2*,  or  other  divalent  cations  for  its 
autophosphorylation  by  PEP  but  not  for  the  transfer  of  the 
phosphoryl  group  to  HPr  (Weigel  et  al.,  1982). 

The  kinetics  of  the  reaction  of  EI  have  been  well  studied 
and  the  data  have  extensively  been  reviewed  (Postma  et  al., 
1993;  Meadow  et  al.,  1990).  According  to  the  studies  on  Els 
from  enteric  bacteria,  EI  associates  at  room  temperature  to 
form  an  active  dimer  of  identical  subunits,  which  is 
phosphorylated  on  each  subunit  in  the  presence  of  Mg2*  and  PEP. 
The  P-EI  then  dissociates,  especially  at  lower  temperatures, 
to  the  monomers  which  then  phosphorylate  HPr.   Reassociation 
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of  dephosphorylated  subunits  is  apparently  the  rate-limiting 
step  for  this  whole  cycle.  This  enzyme  exhibits  a  ping-pong 
Bi  Bi  mechanism  and  catalyzes  the  isotope  exchange  reactions 
expected  for  an  enzyme  that  forms  a  covalent  intermediate. 
The  apparent  Km  values  range  from  0.2  to  0.6  mM  for  PEP  and  4 
to  30  MM  for  HPr. 

The  gene  encoding  EI  (ptsJ)  has  been  cloned  and  sequenced 
from  3  Gram  negative  organisms,  E.  coli  (Saffen  et  al.,  1987), 
S.  typhimurium  (LiCalsi  et  al.,  1991),  and  Alcaligenes 
eutrophus  (Pries  et  al.,  1991),  and  from  4  Gram  positive 
organisms.  Bacillus  subtilis  (Reizer  et  al.,  1993), 
Staphylococcus  carnosus  (Kohlbrecher  et  al.,  1992), 
Streptococcus  mutans  (Boyd  et  al.,  1994),  and  Streptococcus 
salivarius  (Gagnon  et  al.,  1992)  (Table  2).  In  all  cases, 
there  is  a  high  degree  of  sequence  similarity  around  and 
including  the  phosphorylated  histidine  residue  (Postma  et  al., 
1993)  .  The  deduced  amino  acid  sequences  of  Els  from  E.  coli 
and  S.  typhimurium  differ  in  only  16  residues,  consistent  with 
the  fact  that  these  proteins  are  functionally  interchangeable 
in  these  organisms  (Postma  et  al.,  1993).  However,  these  Els 
and  those  from  Gram  positive  bacteria  often  substitute  poorly 
for  one  another  in  PEP-dependent  phosphorylation  of 
heterologous  HPr  (Postma  et  al.,  1993).  Rhodobacter 
capsulatus  produces  a  multiphosphoryl  transfer  protein  which 
contains  the  functional  and  structural  equivalents  of  EIIA, 
HPr,  and  EI  in  a  single  polypeptide  chain  (Wu  et  al.,  1990). 
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In  addition,  two  other  enzymes,  pyruvate  phosphate  dikinase 
(Pocalyko  et  al.,  1990;  Wu  et  al.,  1990)  and  PEP  synthase 
(Niersbach  et  al.,  1992),  were  found  to  have  regions  of 
sequence  which  are  similar  to  EI.  These  included  the  region 
containing  the  phosphorylated  histidine  residue. 
Histidine-Containinq  Protein  (HPr) 

HPr,  the  second  general  protein  involved  in  the 
phosphoryl  transfer  reaction  sequence,  is  a  small  monomeric 
protein  with  a  molecular  weight  of  9,000  to  10,000.  Its  small 
size  and  heat  stability  have  aided  its  purification  from  a 
variety  of  organisms  (Postma  et  al.,  1993).  The  phosphoryl 
group  from  P-EI  is  transferred  to  the  N-l  position  of  a 
histidine  residue  (His15)  in  HPr.  All  HPrs  so  far  studied  from 
Gram  positive  bacteria  can  also  be  phosphorylated  by  an  ATP- 
dependent  HPr  kinase  at  a  serine  residue  (Ser46)  as  part  of  an 
important  regulatory  modification  (Meadow  et  al.,  1990;  Reizer 
et  al.,  1993).  This  regulatory  phosphorylation  also  occurs  in 
HPr  from  M.  capricolum,  an  unusual  bacterium  (Zhu  et  al., 
1993) .  The  region  around  Ser46  is  conserved  in  all  sequenced 
HPrs  of  Gram  positive  bacteria.  The  Ser46-phosphorylated  HPr 
is  not  able  to  transfer  its  phosphate  group  to  a  sugar,  and 
the  PEP-dependent  phosphorylation  at  His15  of  a  Ser46- 
phosphorylated  HPr  is  600-fold  slower  than  the  phosphorylation 
of  a  non-phosphorylated  HPr  (Reizer  et  al.,  1993).  In  B. 
subtilis,  this  regulatory  phosphorylation  is  involved  in 
catabolite  repression  (Deutscher  et  al.,  1994;  Reizer  et  al., 
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1993)  .  Ser46  is  also  present  in  HPrs  from  Gram  negative 
bacteria  but  is  not  phosphorylated,  and  the  corresponding  HPr 
kinase  is  absent  in  these  organisms.  The  region  surrounding 
Ser46  in  Gram  negative  organisms  shows  little  sequence  identity 
with  those  from  Gram  positive  organisms  (Meadow  et  al.,  1990; 
Reizer  et  al.,  1993). 

The  three-dimensional  structures  of  HPr  from  both  Gram 
negative  and  Gram  positive  bacteria  have  been  investigated  by 
several  different  methods:  two-dimensional  nuclear  magnetic 
resonance  (NMR)  and  three-dimensional  NMR  of  HPr  in  solution 
and  X-ray  crystallography  (Postma  et  al.,  1993).  The 
structure  of  the  E.  coli  HPr  determined  by  NMR  studies 
consists  of  four  8-strands  that  form  a  single,  pleated 
antiparallel  6-sheet  on  one  face  of  the  protein  and  three  a- 
helices  that  lie  in  a  plane  parallel  to  the  6-sheet. 
According  to  the  X-ray  structure  of  the  E.  coli  HPr,  however, 
the  four  6-strands  exist  in  two  pairs  removed  from  each  other 
and  a-helices  lie  in  different  planes  and  orientations, 
whereas  the  region  containing  the  phosphorylated  His  residue 
is  the  same  in  both  studies  (Postma  et  al.,  1993).  The  B. 
subtilis  HPr  has  also  been  investigated  by  X-ray 
crystallography  and  NMR.  The  overall  topology  of  three 
dimensional  structure  of  B.  subtilis  HPr  derived  from  NMR  and 
X-ray  crystallography  is  very  similar.  The  secondary- 
structure  folding  topology  of  this  protein  is  the  classical 
topology  which  has  been  described  as  an  open-face  6-sandwich. 
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It  is  formed  by  four  antiparallel  B-strands  with  three  apposed 
a-helices  packed  on  top  of  the  B-strands.  Both  His15  and  Ser46 
are  located  on  the  surface  of  the  protein  with  their  side 
chains  capping  the  N-termini  of  the  first  and  second  a- 
helices,  respectively  (Chen  et  al.,  1993). 

The  primary  amino  acid  sequences  of  HPr  from  11  bacteria 
have  been  determined  (Table  2)  .  Those  from  E.  coli  and  S. 
typhimurium  are  identical  (Byrne  et  al.,  1988),  and  only  one 
conservative  exchange  from  K.  pneumoniae  (Titgemeyer  et  al., 
1990) .  All  HPrs  share  a  striking  similarity  around  the  active 
site  His  residue,  and  all  have  this  active  site  His  residue  at 
position  15.  All  HPrs  also  have  an  Arg  residue  at  position  17 
(Meadow  et  al.,  1990).  HPrs  from  Gram  positive  bacteria, 
although  very  similar  to  each  other,  do  differ  considerably  in 
amino  acid  sequences  from  the  Gram  negative  bacterial  HPrs 
outside  of  the  region  containing  the  phosphorylated  His 
residue  (Postma  et  al.,  1993). 
Genetic  Organization  of  ptsH   and  ptsj 

The  arrangements  of  ptsH  and  ptsJ  on  genome  are  the  same 
in  most  of  bacteria  with  the  exception  of  M.  capricolum  (Fig 
1) .  In  most  bacterial  species  characterized  thus  far,  these 
two  genes  are  always  adjacent  to  each  other  and  form  an  operon 
with  ptsH  on  the  5'  end.  The  M.  capricolum  ptsH  gene, 
however,  is  part  of  a  monocistronic  operon  that  is  situated 
between  two  ORFs  unrelated  to  the  function  of  PTS  (Zhu  et  al., 
1993) .   The  genes  on  two  flanking  regions  of  ptsHI   operon  are 
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vary  in  different  organisms.  In  some  organisms,  these  regions 
contain  PTS  related  genes,  and  in  other  organisms,  genes 
unrelated  to  PTS  functions  are  present  in  these  regions.  For 
example,  the  upstream  region  of  B.  subtilis  ptsH  contains  the 
gene  encoding  glucose  permease  (ptsG)  (Zagorec  and  Postma, 
1992),  but  in  E.  coli  and  S.  typhimurium,  this  region  encodes 
O-acetylserine  (thiol) -lyase  A  (cysK) ,  an  enzyme  involved  in 
the  synthesis  of  L-cysteine  from  O-acetyl-L-serine  and  sulfide 
(Byrne  et  al.,  1988). 
Enzyme  II  Complexes 

The  sugar-specific  PTS  permeases  (EII)  are  the  most 
diverse  PTS  proteins  (Meadow  et  al.,  1990).  They  may  consist 
of  one,  two,  three,  or  four  distinct  polypeptide  chains  (Saier 
and  Reizer,  1992).  EII  proteins  found  within  a  single  species 
often  show  little  seguence  similarity  (Meadow  et  al.,  1990). 
However,  regardless  of  the  number  of  polypeptide  chains  and 
the  orders  of  these  chains,  most  Ells  share  several  conserved 
features.  First,  they  generally  have  similar  molecular 
weights  of  about  68,000,  corresponding  to  about  635  total 
amino  acyl  residues  (Saier  et  al.,  1988).  Second,  there  are 
always  at  least  three  well-recognized  functional  domains:  a 
hydrophobic  transmembrane  domain  which  binds  and  transports 
the  sugar  substrate  (EIIC) ,  a  hydrophilic  enzyme  Ill-like 
domain  which  possesses  the  first  phosphorylation  site  (always 
a  histidyl  residue)  (EIIA) ,  and  a  second  hydrophilic  protein 
or  protein  domain  which  possesses  the  second  phosphorylation 
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site  (either  a  cystidyl  residue  or  a  histidyl  residue)  (EIIB) 
(Fig.  2)  (Saier  and  Reizer,  1992)  .  The  activity  of  EII  may  be 
controlled  by  HPr  Ser46  phosphorylation  via  the  ATP-dependent 
kinase  in  Gram  positive  bacteria  (Ye  et  al.,  1994b). 
Regulation  of  PTS  Protein  Synthesis 

The  synthesis  of  PTS  proteins  is  highly  regulated.  In 
enteric  bacteria,  expression  of  the  ptsH,  ptsl  and  err 
increases  about  threefold  during  growth  on  PTS  sugars, 
especially  glucose,  when  compared  to  growth  on  lactate  (Mattoo 
and  Waygood,  1983;  Stock  et  al.,  1982).  Anaerobic  growth 
conditions  also  favor  higher  expression  (Lengeler  and 
Steinberger,  1978) .  Transcription  of  the  ptsHI  operon  is  also 
enhanced  by  unphosphorylated  EIICBGlc  (De  Reuse  and  Danchin, 
1988;  De  Reuse  and  Danchin,  1991).  These  three  genes  (ptsH, 
ptsl,  err)  constitute  an  operon,  but  three  major  mRNA  species 
have  been  found,  composed  of  those  including  entire  three 
genes,  ptsH  only,  and  err  only,  respectively  (De  Reuse  and 
Danchin,  1988).  In  B.  subtilis,  the  expression  of  sucrose- 
specific  EII  of  PTS  is  induced  by  sucrose  (Fouet  et  al., 
1989)  . 

The  expression  of  ptsHI  operon  of  E.  coli  is  under  very 
complex  regulation.  It  is  controlled  by  two  contradictory 
regulatory  mechanisms,  the  glucose-mediated  and  the  CAP-cAMP- 
mediated  activations  (De  Reuse  et  al.,  1992;  Ryu  and  Garges, 
1994)  which  may  involve  three,  possibly  five,  promoters 
(Postma  et  al.,  1993;  Ryu  and  Garges,  1994).   Three  of  these 


Figure  2 .  Schematic  depiction  of  representative  PTS 
permeases  and  their  energy-coupling  proteins  fround  in 
enteric  bacteria  (A  to  D  and  F) ,  B.  subtilis  (E) ,  and  R. 
capsulatus    (G)  (Saier  and  Reizer,  1993). 
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promoters  (PO,  PI,  and  Px)  are  located  upstream  from  ptsH, 
separated  by  about  100  bp.  A  fourth  (P2) ,  the  major  promoter 
for  err,  is  located  within  the  3' -terminal  end  of  ptsl.  A 
fifth  transcription  start  site,  possibly  involved  in 
antitermination,  is  located  downstream  from  ptsH  and  is 
directed  against  PO  and  PI.  In  the  PO-Px-Pl  promoter  region, 
there  are  two  sequences  showing  similarity  with  the  consensus 
of  CAP-binding  sites,  one  of  which,  CAPa,  is  located  near  PO, 
and  the  other,  CAPb,  is  located  in  the  -35  region  of  PO.  PO 
is  the  major  promoter  in  the  presence  of  the  CAP-cAMP  complex 
and  PI  is  the  major  promoter  in  its  absence.  In  the  absence 
of  CAP,  PO  is  occluded.  In  the  presence  of  CAP,  binding  of 
CAP  to  the  PO  site  is  increased  by  PI.  P2  is  not  activated  by 
the  CAP-cAMP  complex.  Most  recently,  Ryu  and  Garges  (1994) 
showed  that  PO  and  PI  exhibit  a  switching  mechanism  in  vitro, 
where,  depending  upon  the  presence  or  absence  of  CAP-cAMP, 
transcription  is  initiated  from  different  start  sites  (POa  and 
POb,  Pla  and  Plb) .  In  vivo  POa  is  the  major  start  site  for 
PO,  and  POb,  a  site  3  bases  upstream  from  POa,  is  never 
detected,  although  POb  is  active  in  vitro  when  linear  DNA  is 
used  and  CAP-cAMP  is  present.  There  is  no  PO  expression  when 
CAP  or  cAMP  are  absent  in  the  cell  unless  glucose  is  present. 
In  vivo,  most  transcription  from  PI  is  from  Pla  unless  glucose 
is  present.  Glucose  causes  a  switch  to  Plb,  a  site  7  bases 
downstream  from  Pla.  But  in  both  events,  the  overall 
expression  of  Pla  and  Plb  do  not  change  much,  meaning  that 
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there  is  very  little  CAP-cAMP  activation  of  this  promoter  in 
vivo.  Apparently,  some  barrier  to  Plb  initiation  can  be 
relieved  by  the  addition  of  glucose.  Px  is  recognized  by  cr32 
RNA  polymerase,  which  is  absent  unless  CAP-cAMP  or  glucose  are 
present.  All  of  these  results  indicate  that  the  two  different 
regulatory  mechanisms  (one  through  CAP-cAMP,  the  other  through 
glucose)  are  working  together  for  fine  control  of  ptsHI 
operon. 

Transcription  antitermination  is  another  way  to  regulate 
the  expression  of  PTS  proteins.   One  such  example  is  the 
expression  of  bgl   operon  in  decryptified  E.   coli   strains.   A 
model  on  this  antitermination  mechanism  has  been  proposed 
(Postma  et  al.,   1993).    This  operon  is  preceded  by  a 
constitutive  promoter  and  consists  of  three  genes,  bglG,  bglF , 
and  bglB.        These  genes  encode  antiterminator,  EIIB9t,  and 
phospho-6-glucosidase,  respectively.  The  first  gene,  bglG,   is 
flanked  on  both  sides  by  two  terminators  tl  and  t2.   The 
transcription  of  the  operon  is  terminated  by  tl  and  t2  in  the 
absence  of  inducer,  B-glucosides.   The  antiterminator,  BglG, 
can  be  phosphorylated  in  a  reversible  reaction  by  P-EIIB9'. 
The  phosphorylated  BglG  is  inactive.   in  the  presence  of  B- 
glucosides,  the  uptake  of  B-glucosides  causes  the  phosphoryl 
group  on  P-EIIB<>1  to  preferentially  transfer  to  the  substrate. 
This  drain  causes  the  dephosphorylation  of  P-BglG  and 
subsequently  become  an  active  antiterminator.  The  active  BglG 
binds  to  specific  RNA  sequences  located  upstream  of  tl  and  t2, 
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prevents  termination,  and  makes  the  transcription  of  bgl 
operon  possible.  In  B.  subtilis,  expression  of  sacPA  operon 
which  encodes  a  EIIBCscr  {sacP)  and  a  sucrase  (sacA)  is  also 
controlled  by  a  proposed  antiterminator,  SacT.  This 
antiterminator  is  homologous  to  the  E.  coli  BglG,  but  requires 
EI  and  HPr  for  activity  (Arnaud  et  al.,  1992). 

Many  other  mechanisms  exist  to  regulate  the  expression  of 
PTS  proteins  which  include  regulation  through  classical 
regulator-operator  pairs,  such  as  the  expression  of  nag   genes 
from  E.    coli   and  K.   pneumoniae    (Postma  et  al.,  1993). 
Metabolic  Regulation  of  Non-PTS  Sugars  by  PTS 

In  addition  to  the  primary  function  in  sugar  transport, 
the  PTS  has  an  important  role  in  the  regulation  of  metabolism 
of  non-PTS  sugars.  It  has  been  observed  in  enteric  bacteria 
that  mutants  defective  in  EI  and  HPr  are  not  only  unable  to 
utilize  PTS  carbohydrates  as  the  sole  source  of  carbon  for 
growth,  but  also  unable  to  grow  on  a  number  of  non-PTS  sugars, 
including  lactose,  maltose,  melibiose,  glycerol,  rhamnose, 
xylose,  and  Krebs  cycle  intermediates.  This  defect  can  be 
relieved  by  the  addition  of  cAMP  or  by  the  introduction  of  a 
err  mutation  (Postma  et  al.,  1993).  A  model  for  this  type  of 
PTS-mediated  regulation  in  enteric  bacteria  has  been  proposed 
(Postma  et  al.,  1993).  The  central  regulatory  molecule  is 
EHAGlc,  the  product  of  err.  EIIAGlc  can  exist  in  two  states: 
phosphorylated  EIIAGlc  (P-EIIAGlc)  and  nonphosphorylated  EIIAGlc. 
The  phosphorylation  state  of  EIIAGlc  is  determined  by  the 
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balance  between  phosphorylation  via  P-HPr  and 
dephosphorylation  via  EIICBGlc  in  the  presence  of  its 
substrate.  EIIGlc  and  P-EIIGlc  interact  with  and  regulate 
different  proteins.  EIIGlc  binds  to  and  inhibits  several 
enzymes  essential  in  carbohydrate  metabolism,  for  example,  the 
lactose  and  melibiose  carriers.  But  the  binding  of  EIIGlc 
occurs  only  when  a  substrate  of  the  target  protein  is  present. 
The  direct  result  of  binding  is  inhibition  of  uptake  and  of 
subsequent  metabolism  of  these  carbon  sources.  P-EIIAGlc  is  an 
activator  of  adenylate  cyclase,  the  enzyme  for  the  synthesis 
of  CAMP.  cAMP  plays  a  central  role  in  gene  expression  in 
enteric  bacteria.  Together  with  CRP,  cAMP  is  involved  in  the 
(generally)  positive  regulation  of  a  number  of  catabolic 
genes.  The  inhibition  of  transport  and  metabolism  of  non-PTS 
carbohydrates  can  be  brought  about  by  any  PTS  carbohydrate 
since  all  PTS  carbohydrates  can  dephosphorylate  P-EIIAGlc, 
either  directly  via  EIICBGlc  (if  the  PTS  carbohydrate  is 
glucose)  or  indirectly  (transport  and  phosphorylation  of  all 
other  PTS  sugars  via  their  respective  EII  complexes  result  in 
dephosphorylation  of  P-HPr) .  Since  the  phosphorylation  of 
EIIAGlc  by  P-HPr  is  a  reversible  process,  the  dephosphorylation 
of  P-HPr  by  other  EII  complexes  will  dephosphorylate  P-EIIAGlc. 
Recently,  van  der  Vlag  et  al.  (1994)  demonstrated  a 
quantitative  relationship  between  the  amount  of  EIIAGlc  and  the 
regulation  of  glycerol  and  maltose  metabolism  in  S. 
typhimurium.     The  inhibition  of  glycerol  and  maltose  uptake  by 
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the  addition  of  a  PTS  sugar  was  sigmoidally  related  to  the 
amount  of  EIIAGlc.  For  complete  inhibition  of  glycerol  uptake, 
a  minimal  ratio  of  about  3.6  mol  of  EIIAGlc  to  1  mol  of 
glycerol  kinase  was  required  while  a  ratio  of  about  18  mol  of 
EIIAGlc  to  1  mol  of  MalK  protein  (part  of  maltose  transport 
system  and  the  target  of  EIIAGlc)  was  required  for  complete 
inhibition  of  maltose  uptake.  Varying  the  level  of  EIIAGlc  did 
not  affect  the  growth  rate  on  glycerol,  the  rate  of  glycerol 
uptake,  or  the  synthesis  of  glycerol  kinase.  In  contrast, 
the  growth  rate  on  maltose,  the  rate  of  maltose  uptake,  and 
the  synthesis  of  maltose-binding  protein  increased  two-  to 
fivefold  with  increasing  levels  of  EIIAGlc.  The  synthesis  of 
MalK  protein  was  not  affected  by  varying  the  levels  of  EIIAGtc. 
Many  Gram  positive  organisms  also  show  a  distinct 
"glucose  effect"  (Meadow  et  al.,  1990).  Several  mechanisms 
exist  whereby  expression  of  catabolic  genes  can  be  regulated 
in  Gram  positive  bacteria.  In  B.  subtilis,  catabolite- 
repressible  genes  are  controlled  by  multiple  mechanisms  rather 
than  by  a  single  global  regulatory  system,  and  these 
mechanisms  are  known  to  differ  from  those  operating  in  enteric 
bacteria  (Stewart,  1993).  Deutscher  et  al.  (1994)  has 
demonstrated  that  phosphorylation  of  HPr  at  Ser46  by  the  ATP- 
dependent  kinase  is  directly  or  indirectly  involved  in 
catabolite  repression  in  B.  subtilis.  After  Ser44  in  HPr  of 
a  B.  subtlis  strain  was  replaced  with  a  nonphosphorylatable 
alanyl  residue,  the  catabolite  repressive  effects  of  glucose 
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on  gluconate  kinase,  glucitol  dehydrogenase,  and  the  mannitol- 
specific  catabolic  enzymes  were  abolished,  but  this  mutation 
had  no  effect  on  the  uptake  either  of  the  PTS  substrates, 
glucose  and  mannitol,  or  of  the  non-PTS  substrates,  gluconate 
and  glucitol.  A  comparable  result  was  obtained  in  S. 
salivarius  (Vadeboncoeur  et  al.,  1994).  The  replacement  of 
Met48  in  HPr  with  a  valine  resulted  in  the  inhibition  of 
phosphorylation  of  HPr  on  Ser46  by  the  ATP-dependent  kinase  but 
did  not  prevent  phosphorylation  of  HPr  by  enzyme  I  or  the 
phosphorylation  of  EII  complexes  by  HPr  (His-P) .  However,  the 
mutant  had  pleiotropic  properties,  including  reduced  growth  on 
non-PTS  sugars.  In  Lactobacillus  brevis ,  glucose  and  lactose 
are  transported  via  a  proton  symport.  The  activities  of 
glucose  and  lactose  permeases  in  this  organism  are  also 
regulated  by  the  phosphorylation  of  HPr  on  Ser46  (Ye  et  al., 
1994a) . 
Other  Functions  of  PTS 

PTS  also  has  many  other  functions,  including  regulation 
of  gluconeogenesis  and  other  processes,  interaction  with 
acetate  kinase,  nitrogen  regulation  (Postma  et  al.,  1993). 
The  PTS  is  also  a  signal  transduction  system  through  which 
bacteria  respond  by  positive  chemotaxis  to  the  presence  of  PTS 
carbohydrates.  in  PTS-dependent  chemotaxis,  stimulation 
corresponds  to  uptake  and  phosphorylation  of  a  substrate 
through  an  EII.  No  methyl-accepting  chemotaxis  proteins 
(CheA,  CheY,  CheW,  and  CheZ)  are  involved  in  this  process, 
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since  mutants  lacking  the  methyl-accepting  chemotaxis  proteins 
retained  normal  PTS-dependent  chemotaxis.  Cells  lacking  a 
specific  EII  do  not  show  chemotaxis  toward  the  corresponding 
substrate.  In  contrast,  mutants  lacking  EI  and  HPr  do  not 
show  a  chemotaxis  response  to  any  PTS  carbohydrate,  even  when 
the  corresponding  Ells  are  present. 


CHAPTER  3 
ISOLATION  OF  THERMOPHILIC  CELLULOLYTIC  BACILLUS   STRAINS 


Most  thermophilic  bacteria  are  generally  isolated  from 
hot  springs,  solfatatas  or  geothermally  heated  soils.  But 
thermophilic  species  of  Bacillus  may  be  isolated  from  a  wide 
range  of  both  thermophilic  and  mesophilic  environments.  They 
comprise  the  first  group  of  thermophilic  microorganisms  to  be 
described  and  characterized  in  detail.  The  first  published 
reference  is  attributed  to  Miquel  in  1918  who  described  an 
organism  growing  at  70°C  and  that  had  been  isolated  from  the 
river  Seine  in  Paris  (Sharp  et  al.,  1992). 

The  genus  Bacillus  is  one  of  the  most  diverse  and 
commercially  useful  groups  of  microorganisms.  Our 
understanding  of  the  genetics  and  physiology  of  one  of 
representatives,  B.  subtilis,  is  second  only  to  that  of  E. 
coli  (Harwood,  1989) .  Representatives  of  this  genus  are 
widely  distributed  in  soils,  air,  and  water  where  they  are 
involved  in  a  range  of  chemical  transformations.  The 
metabolic  diversity  and  low  reported  incidence  of 
pathogenicity  have  led  to  representatives  of  this  group  of 
organisms  being  used  in  a  wide  range  of  industrial  processes. 
Strains  of  Bacillus   are  used  for  the  production  of  four  main 
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types  of  products:  (1)  enzymes,  (2)  antibiotics,  (3)  fine 
biochemicals  including  flavor  enhancers  and  food  supplements, 
and  (4)  insecticides.  It  is  the  enzyme  market  that  currently 
accounts  for  the  main  commercial  importance  of  members  of  this 
genus.  In  particular  they  are  the  major  source  of  hydrolyzing 
enzymes  for  the  food  and  detergent  markets. 

Complete  biodegradation  of  crystalline  cellulose  requires 
the  concerted  action  of  at  least  three  types  of  enzymes  which 
include  cellobiase  (see  Review  of  Literature)  .  Various 
methods  have  been  developed  for  the  screening  of 
microorganisms  which  are  able  to  degrade  crystalline  cellulose 
(Kluepfel,  1988). 

The  objective  of  this  study  was  to  isolate  and  identify 
thermophilic  Bacillus      strains   which   have   cellulolytic 
activities  and  other  hydrolase  activities. 
Materials  and  Methods 

Sources  of  Samples 

102  natural  samples  were  screened  for  the  isolation  of 
thermophilic  cellulolytic  Bacillus  strains.  These  samples 
included  animal  manures,  composts,  foods,  pond  water,  sea 
water,  soils,  etc. 

Isolation  of  Thermophilic  Bacillus     Strains  with  Putative 
Cellulolytic  Activity 

Approximately,  5  grams  of  environmental  samples  were 

combined  with  5  ml  of  nutrient  broth  (3  g  beef  extract,  5  g 

peptone  per  liter)  and  mixed  vigorously  by  votexing  for 
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approximately  5  minutes.  After  a  brief  period  to  allow  the 
settling  of  large  particles,  1  ml  of  this  suspension  was 
combined  with  1  ml  of  100%  ethanol  and  shaken  gently  for  1 
hour  at  room  temperature  (22°C)  to  enrich  for  endospores  by 
inactivating  vegetative  cells  (Koransky  et  al.,  1978). 
Dilutions  of  treated  samples  were  spread  on  nutrient  agar 
plates  and  incubated  overnight  at  55°C  or  65°C.  Single 
colonies  were  transferred  to  master  plates  (nutrient  agar)  and 
to  nutrient  agar  plates  containing  0.25%  powdered  cellulose 
(Solka-Floc  SW40;  James  River  Corp.,  Berlin,  N.H.).  Putative 
cellulolytic  isolates  were  identified  by  the  presence  of  a 
clear  zone  or  a  partially  cleared  halo  after  4  to  7  days  of 
incubation  at  the  temperatures  from  which  they  were  isolated. 
Testing  for  Hydrolases  and  Taxonomic  Traits 

The  presence  of  complete  cellulase  activity  was  tested  by 
using  nutrient  agar  plates  containing  0.25%  cellulose  as 
described  above.  Endoglucanase  activity  was  tested  by  the 
Congo  red  method  with  nutrient  agar  plates  containing  0.2% 
carboxymethyl  cellulose  (CMC)  (Kluepfel,  1988)  .  Putative 
cellobiohydrolase  and  6-glucosidase  activities  were  tested  on 
nutrient  agar  plates  containing  10  mg  of  4-methylumbellif eryl- 
6-D-cellobiopyranoside  (MUC)  or  4-methylumbelliferyl-B-D 
glucopyranoside  (MUG)  per  liter  (Wood  and  Bhat,  1988) . 
Mannosidase,  arabinosidase,  and  xylosidase  activities  were 
screened  in  a  similar  manner  with  the  respective  umbelliferyl 
derivatives.   The  presence  of  pectin-degrading  enzymes  was 
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tested  by  a  modification  of  the  method  described  by  Brown  et 
al.  (1991)  in  which  sodium  polypectate  was  used  instead  of 
alginate.  Taxonomic  tests  were  performed  as  described  in 
Bergey's  manual  (Claus  and  Berkeley,  1986). 

Results 

Thermophiles  were  abundant  in  all  samples  screened  (2,000 
to  1,500,000  CFUs  per  gram  sample).  All  strains  which  were 
highly  cellulolytic  (clear  zones  on  cellulose  agar)  were 
provisionally  identified  as  actinomycete  based  on  cell 
morphology  and  growth  habit.  However,  many  other  isolates 
were  surrounded  by  a  partially  cleared  halo  and  were  presumed 
to  contain  incomplete  cellulase  digestion  systems. 

A  total  of  168  representative  clones  which  produced 
partially  cleared  halos  on  cellulose  plates  were  selected  for 
further  study  (112  were  from  the  55°C  selection  and  56  were 
from  65°C)  .  All  strains  were  aerobic,  endospore-forming,  Gram 
positive  rods.  Among  the  112  strains  from  the  55°C  selection, 
only  28  could  grow  at  65°C.  In  contrast,  strains  selected  at 
65°C  also  grew  at  55°c  and  75°C.  More  detailed  taxonomic 
studies  were  conducted  with  2  representative  strains  from  the 
55°C  selection  and  5  from  the  65°C  selection  (Table  3). 

Closely  related  organisms  may  have  very  similar  proteins. 
In  a  separate  project  (Lai  and  Ingram  unpublished  data) ,  a 
plasmid  containing  the  genomic  DNA  of  strain  XL-65-6  was 
cloned  and  partially  sequenced.  A  search  of  GenBank  data  base 
revealed  that  the  deduced  amino  acid  sequence  of  this  insert 
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was  highly  homologous  with  the  carboxyl  terminal  end  of 
alanine  racemase  from  B.  stearothermophilus,  B.  subtilis ,  E. 
coli,  and  S.  typhimurium  (Table  4)  .  It  is  interesting  to  note 
that  the  deduced  amino  acid  seguence  of  this  insert  shares 
91.4%  identity  and  96.2%  similarity  with  the  B. 
stearothermophilus  enzyme,  but  shares  relatively  low  homology 
with  the  enzymes  from  other  organisms. 

These  organisms  represent  a  rich  source  of  enzymes  for 
the  depolymerization  of  carbohydrates  (Table  5  and  Table  6) . 
All  but  21  of  the  168  isolates  exhibited  activity  on  one  or 
more  cellulolytic  substrates  (MUC,  MUG,  CMC) ,  further 
validating  the  selection  procedure.  No  CMCase  activity  was 
found  in  the  isolates  selected  at  65°C  or  those  selected  at 
55°C  but  able  to  grow  at  65°C  when  tested  at  65°C.  More  than 
half  of  the  isolates  which  were  selected  at  55°C  and  unable  to 
grow  at  65°C  exhibited  endoglucanase  activity.  About  one- 
third  of  the  65°C-growing  isolates  did,  however,  display  weak 
CMCase  activity  when  incubated  at  55°C.  All  isolates 
contained  multiple  activities  associated  with  the  hydrolysis 
of  cellulose  and  hemicellulose  as  well  as  other  polymers,  as 
illustrated  by  the  representative  strains  (Table  5  and  Table 
6). 

The  profile  of  types  of  cellulase  and  hemicellulase 
activities  in  65°C-  and  55°C-growing  strains  are  fairly 
different.   Comparing  the  cellulase  activities,  MUG  activity 
is  the  most  abundant  activity  in  65°C-growing  strains.   The 
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percentage  of  strains  with  this  activity  (70.2%)  is 
predominantly  higher  than  those  with  CMC  or  MUC  activities 

(28.6%  and  48.8%,  respectively).  Only  8.3%  of  the  strains 
exhibited  all  three  cellulase  activities.  In  55°C-growing 
strains,  all  three  cellulase  activities  are  fairly  common. 
Approximately  80%  of  the  strains  exhibited  MUC  activity,  76% 
exhibited  MUG  activity,  and  69%  exhibited  CMC  activity.  Over 
half  of  these  strains  contained  all  of  the  three  activities. 
For   the   three   hemicellulase   activities   tested 

(arabinosidase,  mannosidase,  and  xylosidase)  ,  these  two  groups 
of  bacteria  also  exhibited  different  profiles.  In  65°C- 
growing  strains,  MUM  activity  was  the  most  common  activity 

(77%)  with  MUA  and  MUX  activities  far  less  common  (23%  and 
32%,  respectively).  In  contrast,  MUM  activity  was  the  least 
common  activity  in  55°C-growing  stains.  Only  19%  of  the 
strains  had  this  activity,  while  more  than  half  of  the  strains 
had  MUA  and  MUX  activities  (61%  and  52%,  respectively). 
Strains  which  exhibited  all  three  hemicellulase  activities 
were  rare  in  both  groups  of  strains,  with  14%  for  65°C-growing 
strains  and  9%  for  55°C-growing  strains. 

Other  hydrolytic  activities  are  also  common  in  all  the 
isolates.  Approximately  69%  of  the  65°C-growing  strains 
exhibited  amylase  activity  and  58%  of  the  55°C-growing  strains 
had  caseinase  activity. 

Strain  XL-65-6,  which  was  isolated  from  rotting  wood  by 
Lake  Alice  (Gainesville,  Florida),  was  selected  for  further 
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study  on  cellobiose  utilization  at  a  molecular  level  since  it 
had  high  activities  on  both  MUC  and  MUG,  as  well  as  on  MUA  and 
MUM  (Table  5) . 

Discussion 

The  objective  of  this  study  was  to  isolate  thermophilic 
cellulolytic  strains  of  Bacillus.  The  isolation  method  used, 
combining  ethanol  pre-treatment,  high  incubation  temperature, 
and  nutrient  agar  medium,  appeared  to  be  fairly  selective  for 
thermophilic  Bacillus  species  from  various  habitats.  All 
colonies  isolated  were  either  Bacillus  species  or 
actinomycetes.  Bacillus  species  constituted  the  majority  of 
these  isolates.  Since  most  of  actinomycete  also  form  spores 
and  many  are  thermophiles  (Brock  and  Madigan,  1991) ,  it  was 
not  surprising  that  members  of  this  group  were  also  recovered 
with  Bacillus  species  although  the  number  of  actinomycete  was 
much  lower. 

Representative  strains  were  classified  according  to 
Bergey's  Manual  (Claus  and  Berkeley,  1986).  All  strains  had 
the  distinctive  characteristics  of  Bacillus  species  -  aerobic, 
endospore-forming,  Gram  positive  rods. 

The  Bacillus  species  which  can  grow  at  65°C  or  higher 
include  B.  acidocaldarius ,  B.  schlegelii,  and  B. 
stearothermophilus .  But  B.  acidocaldarius  can  grow  only  under 
acidic  condition  (below  pH  6)  while  the  65°C  isolates  from 
this  study  grew  well  at  pH  6.8  (Table  3).  B.  schlegelii  does 
not  produce  acid  from  glucose  and  xylose,  and  lacks  the 
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ability  to  hydrolyze  gelatin  and  starch.  Most  of  the  65°C 
isolates  produced  acid  by  fermenting  these  two  sugars  and 
hydrolyzed  gelatin  and  starch.  In  contrast,  most  of  the 
traits  of  the  isolates  fit  those  of  B.  stearothermophilus . 
Furthermore,  one  65°C  isolate,  XL-65-6,  had  very  similar 
alanine  racemase  to  that  from  B.  stearothermophilus.  Thus, 
these  65°C  isolates  were  identified  as  B.   stearothermophilus. 

Among  the  Bacillus  species,  only  B.  coagulans  and  B. 
licheniformis  strains  are  capable  of  growing  at  temperatures 
ranging  from  30°C  to  55°C.  B.  licheniformis  are  capable  of 
growing  at  high  NaCl  concentration  (up  to  7%)  ,  while  B. 
coagulans  cannot  grow  at  5%  of  NaCl.  Both  strain  XL-50-60  and 
XL-50-84  grew  at  30°C  and  55°C,  but  not  at  65°C,  and  neither 
grow  in  the  presence  of  2%  NaCl.  Thus,  the  strains  which  were 
isolated  at  55°C  and  unable  to  grow  at  65°C  are  most  similar 
to  B.  coagulans.  It  is  likely  that  the  strains  which  were 
isolated  at  55°C  and  able  to  grow  at  65°C  were  strains  of  B. 
stearothermophilus . 

Many  isolates  did  not  exhibit  all  properties  listed  in 
Bergey's  Manual.  For  example,  11%  -  89%  of  B. 
stearothermophilus  strains  can  grow  at  5%  NaCl.  however,  all 
of  our  isolates  failed  to  grow  in  2%  NaCl.  As  mentioned  in 
the  Literature  Review  section,  both  B.  coagulans  and  B. 
stearothermophilus  are  highly  heterogeneous  species.  The 
positive  traits  in  Bergey's  Manual  indicate  only  that  more 
than  90%  of  strains  are  positive.   The  occurrence  of  some 
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traits  which  are  different  from  the  standard  strains  in 
Bergey's  Manual  is  thus  not  surprising. 

The  activities  on  MUC  and  MUG,  as  well  as  those  of  MUA 
and  MUM  appear  to  be  linked.  Among  B.  stearothermophilus 
strains,  45.2%  had  both  MUC  and  MUG  activities.  This 
percentage  is  very  close  to  that  showing  MUC  activity  alone 
(48.8%).  The  percentage  of  strains  with  CMC  and  MUC  or  MUG 
activity  was  different  from  those  with  only  one  activity.  The 
percentage  of  B.  coagulans  strains  expressing  both  MUC  and  MUG 
activities  was  75.0%,  similar  to  MUG  activity  alone  (76.2%). 
Those  containing  CMC  activity  were  also  quite  different. 
There  are  at  least  three  possible  explanations  for  these 
results.  First,  genetic  linkage,  where  in  B. 
stearothermophilus  and  B.  coagulans  MUG  and  MUC  activity  are 
linked.  Second,  ambiguity  of  the  substrates,  MUC  and  MUG. 
The  chromogenic  substrate  MUC  has  often  been  used  to  measure 
cellobiohydrolase  activity,  but  potential  confounding 
activities  from  6-glucosidases  are  well  known  (Coughlan, 
1988).  Thus,  both  substrates  may  be  cleaved  by  a  single 
enzyme.  Third,  multiple  activities,  where  one  enzyme  truly 
has  both  exoglucanase  and  6-glucosidase  activity.  A 
comparable  situation  may  occur  for  MUA  and  MUM  activity. 


CHAPTER  4 

CLONING  AND  SEQUENCING  OF  CELLOBIOSE-SPECIFIC  PERMEASE  OF 

PHOSPHOENOLPYRUVATE-DEPENDENT  PHOSPHOTRANSFERASE 

SYSTEM  FROM  BACILLUS   STEAROTHERMOPHILUS 


Despite  the  potential  abundance  of  cellobiose  in  the 
environment,  comparatively  little  is  known  about  the  uptake 
systems  which  initiate  metabolism  in  bacteria.  The  best 
characterized  eel  operon  is  cryptic  and  must  be  activated  in 
laboratory  strains  of  E.  coli  (Parker  and  Hall,  1990). 
Functional  genes  for  cellobiose  utilization  have  been  reported 
in  some  recent  isolates  of  E.  coli  from  human  and  animal 
manures  (Hall  and  Faunce,  1987) .  Cellobiose  is  also 
metabolized  by  other  enteric  bacteria  such  as  Klebsiella  sp. 
(Al-Zaag,  1989)  and  Erwinia  sp.  (El  Hassouni  et  al.,  1992). 
Considering  the  similarities  between  these  enteric  genera,  it 
is  likely  that  all  use  a  cellobiose  PTS  to  initiate 
metabolism.  Many  Gram  positive  and  Gram  negative  bacteria 
have  been  reported  to  contain  cell-associated  B-glucosidases 
(Coughlan  and  Mayer,  1992) .  Gram  positive  bacteria  such  as 
Cellulomonas  uda  and  C.  favigena  contain  cellobiose 
phosphorylases  which  are  presumed  to  be  involved  in 
intracellular  cellobiose  metabolism  (Coughlan  and  Mayer, 
1992)  . 
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The  objective  of  this  study  was  the  cloning  and 
sequencing  of  Bacillus  stearothermophilus  genes  which  confer 
the  ability  to  hydrolyze  the  cellobiose  model  substrates,  4- 
methylumbelliferyl-B-D-cellobiopyranoside  (MUC)  and  4- 
methylumbelliferyl-B-D-glucopyranoside  (MUG) . 
Materials  and  Methods 

Bacterial  Strains.  Plasmids.  and  Growth  Conditions 

Bacterial  strains  and  plasmids  used  in  this  study  are 
listed  in  Table  7.  Strains  of  B.  stearothermophilus  were 
grown  in  Difco  Nutrient  Broth  (or  agar)  at  65°C.  Strains  of 
E.  coli  were  grown  at  37°C  in  Luria  broth  (or  agar)  (Atlas  and 
Parks,  1993).  Ampicillin  (50  /xg/ml)  and  tetracycline  (10 
jtjg/ml)  were  added  to  media  after  sterilization  as  appropriate 
for  selection  of  recombinant  E.  coli. 
DNA  Manipulation 

Procedures  used  for  the  preparation  of  E.  coli  plasmids, 
assembly  of  recombinant  DNA,  and  the  transformation  of  E.  coli 
were  as  described  previously  (Sambrook  et  al.,  1989). 
Digestions  with  restriction  enzymes  were  carried  out  as 
recommended  by  the  manufacturers.  Genomic  DNA  was  isolated 
from  B.  stearothermophilus  strain  XL-65-6  by  the  standard 
method  (Cutting  and  Vander  Horn,  1990) . 
Cloning  of  B.  stearothermophilus  eel   Operon 

Initial  testing  indicated  that  DNA  isolated  from  XL-65-6 
was  not  efficiently  digested  by  restriction  enzyme  Sau3AI  but 
was  digested  into  small  fragments  by  Mbol.     Poor  digestion  by 
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Sau3AI  is  presumed  to  result  from  the  presence  of  5- 
methylcytosine  which  is  tolerated  by  Mbol  (Roberts  and 
Macelis,  1992)  .  Fragments  of  4-6  kilobase  pairs  (kbp)  were 
isolated  from  partial  Mbol  digestions  of  chromosomal  DNA  by 
agarose  gel  electrophoresis.  These  were  ligated  into  the 
BamHI  site  of  pUC18  and  transformed  into  E.  coli  strain  DH5a. 
Plasmid  DNA  was  isolated  from  the  pooled  colonies 
(approximately  3000  clones)  and  served  as  an  amplified 
library.  Secondary  transformants  of  DH5a  were  screened  for 
MUC  and  MUG  activity.  Clones  containing  both  activities  were 
subsequently  found  to  contain  the  eel  operon. 
Southern  hybridization 

Samples  of  genomic  DNA  from  B.  stearothermophilus  XL-65-6 
were  digested  with  five  restriction  enzymes  (BssHII,  EcoRl, 
Hindlll,  PstI,  and  Styl) ,  respectively.  After  agarose  gel 
electrophoresis,  fragments  were  transferred  to  Zeta-Probe  GT 
membranes  (Bio-Rad  Laboratories,  Richmond,  CA)  and  probed  with 
a  1.1  kbp  Styl  fragment  from  within  the  B.  stearothermophilus 
eel  operon.  This  probe  was  labeled  with  digitoxigenin  (Genius 
System,  Boehringer  Mannheim  Biochemicals,  Indianapolis,  IN) 
and  used  as  recommended  by  the  manufacturer.  Genomic  DNA  from 
E.  coli  DH5a  was  digested  with  PstI  and  served  as  a  control. 
DNA  sequencing  and  sequence  analysis 

The  5  kbp  fragment  in  pLOI902  was  subcloned  into 
pBluescript  II  KS"  and  pBluescript  SK"  prior  to  sequencing. 
Two  series  of  nested  deletions  were  generated  by  using  the 
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Erase-a-Base  System  (Promega  Corp.,  Madison,  WI)  to  allow 
sequencing  of  the  entire  fragment  in  both  directions.  The 
Magic  Miniprep  DNA  Purification  System  (Promega  Corp., 
Madison,  WI)  was  used  to  prepare  double-stranded  plasmid  DNA. 
Sequencing  was  performed  by  the  dideoxynucleotide-chain 
termination  method  of  Sanger  et  al.  (1977)  using  fluorescent- 
labeled  primers  (forward,  5'-CACGACGTTGTAAAACGAC-3  '  ;  forward, 
5  '  -GGTTTTCCCAGTCACGACG-3  '  ;  reverse ,  5  '  -ATAACAATTTCACACAGGA-3  ' ) 
(LI-COR,  Inc.,  Lincoln,  NE)  and  the  Sequenase  7-deaza-dGTP  DNA 
Sequencing  kit  (United  States  Biochemical  Corp. ,  Cleveland, 
OH)  .  Regions  of  compression  were  resolved  by  using  either 
dITP  (United  States  Biochemical  Corp.,  Cleveland,  OH)  or  the 
TaqTrack  Deaza  Sequencing  System  (Promega  Corp.,  Madison,  WI)  . 
Sequencing  was  carried  out  on  a  LI-COR  model  4000  DNA 
sequencer  and  image  analyzer  using  7%  acrylamide  gels  (Long 
Ranger  Gel  Concentrate,  AT  Biochem,  Malvern,  PA) . 

Nucleotide  and  deduced  amino  acid  sequences  were 
manipulated  and  analyzed  with  the  Genetics  Computer  Group 
Sequence  Analysis  Software  Package  (University  of  Wisconsin, 
Genetics  Computer  Group,  Madison,  WI)  (Devereux  et  al.,  1984) 
and  the  National  Center  for  Biotechnology  Information  using 
the  BLAST  network  service  (Benson  et  al.,  1993).  Amino  acid 
sequences  were  aligned  by  using  the  Clustal  V  program 
(Higgins  et  al.,  1992).  The  nucleotide  sequence  data  has  been 
submitted  to  GenBank  and  assigned  the  accession  number  U07818. 
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Results 

Cloning  of  Genes  Encoding  MUC  and  MUG  Activities  (eel  Operon) 
The  plasmid  library  was  screened  for  activities 
hydrolyzing  MUC  and  MUG  after  transformation  into  DH5a. 
Approximately  50  positive  clones  were  identified  from  the 
50,000  colonies  tested.  Those  with  weak  activities  were 
unstable.  Nineteen  clones  were  found  which  exhibited  high 
activities  for  both  substrates.  These  activities  were  present 
together  in  all  clones.  Digestions  with  restriction  enzymes 
indicated  that  all  active  clones  contained  the  same  5  kbp  DNA 
fragment  and  were  siblings.  One  of  these,  denoted  pLOI902, 
was  selected  for  further  study. 

A  restriction  map  of  pLOI902  is  shown  in  Figure  3.  The 
minimal  coding  region  for  the  activities  hydrolyzing  MUC  and 
MUG  was  localized  by  testing  subclones.  Subclones  pLOI911, 
PLOI913,  and  pL0I915  were  constructed  by  deleting  Hindlll, 
Smal,  and  Wsil  fragments,  respectively.  All  but  pLOI915  were 
inactive.  This  subclone  retained  both  activities.  Subclones 
PLOI922,  pLOI923,  pLOI954,  and  pLOI955  were  constructed  by 
deleting  terminal  regions  with  exonuclease  III.  MUC  and  MUG 
activities  remained  after  the  deletion  of  ca.  400  bp  adjacent 
to  the  lac  promoter  (pLOI922)  or  1,500  bp  from  the  opposite 
end  (pLOI954) .  However,  further  deletion  of  375  bp  from 
pLOI922  or  264  bp  from  pL0I954  resulted  in  the  concomitant 
loss  of  both  activities  (pLOI923  and  pLOI955,  respectively). 
Thus  a  rather  large  region  of  DNA  (approximately  3.2  kbp) 
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appeared  to  be  required  for  the  expression  of  MUC  and  MUG 
activities  in  E.  coli  DH5a.  Both  activities  were  expressed  or 
lost  concurrently  in  all  subclones.  Active  clones 
consistently  hydrolyzed  MUG  more  rapidly  than  MUC. 

Experiments  were  conducted  to  determine  the  direction  of 
transcription  of  the  genes  encoding  MUC  and  MUG  activities  in 
pLOI902.  The  presence  or  absence  of  inducer  (isopropyl-thio- 
fi-D  galactopyranoside  or  IPTG)  did  not  alter  the  level  of 
expression  in  XLl-blue.  Attempts  to  subclone  this  fragment 
into  plasmid  pUC18  in  the  opposite  orientation  were 
unsuccessful  using  DH5a  as  a  host,  but  clones  were  readily 
obtained  using  XLl-Blue.  The  resulting  plasmid  (pLOI905)  was 
lethal  when  transformed  back  into  DH5a.  The  growth  of  XLl- 
Blue  harboring  pLOI905  was  strongly  inhibited  by  the  presence 
of  inducer  (IPTG) ,  consistent  with  partial  control  by  the 
resident  lacJq  in  the  F1  element.  Thus,  in  the  original 
clone,  pLOI902,  the  direction  of  transcription  for  the  eel 
operon  is  opposite  to  that  of  the  lac  promoter. 
Analysis  of  DNA  Sequence 

The  entire  5,026  bp  DNA  fragment  in  pLOI902  was  sequenced 
in  both  directions  (Fig.  3  and  4)  .  Four  complete  open  reading 
frames  were  found  in  the  region  required  for  the  two 
activities,  bordered  by  incomplete  genes  at  both  ends.  All 
appear  to  be  transcribed  in  the  same  direction.  The  four 
central  genes  appear  to  be  part  of  an  operon  (bases  1541 
through  4306)  with  a  potential  promoter  at  the  5'  end  and  a 


Figure  4 .  Nucleotide  sequence  of  the  5  kbp  DNA  fragment 
from  B.  stearothermophilus  contained  in  pLOI902.  The 
deduced  amino  acid  sequences  for  the  eel  operon  and  the 
incomplete  regulatory  gene  are  listed  below  the  first 
nucleotide  of  the  corresponding  codon.  A  putative 
promoter  region  for  the  eel  operon  is  underlined  with  the 
-35  and  -10  regions  labeled.  The  sequence  for  the  3' 
terminus  of  16s  rRNA  from  B.  stearothermophilus  (Van 
Charldorp  et  al.,  1981)  is  shown  above  the  potential 
ribosomal  binding  region  (underlined  and  labeled  RB)  from 
the  eel  genes.  Unmatched  bases  are  in  lower  case.  Genes 
within  the  eel  operon  are  marked  at  their  start  codons. 
Stop  codons  are  indicated  by  asterisks.  The  proposed 
terminator  is  marked  with  inverted  arrows  which 
correspond  to  a  proposed  stem-loop  structure. 
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1  TCCCGACGTCCTTGAGTTGACCGAAGCGATGGTCGTCTATGCGGAGAAAAAACTTGGGCG 

PDVLELTEAMVVYAEKKLGR 
6 1  CCGCCTCGCCGAGAAGGTGATGTATGCGCTCGCCATGCACATTCAAACGGCGATCAACCG 

RLAEKVMYALAMHIQTAINR 
121  CCTGCGGGCCGGGACGATCGTTTCCCATCCGAAACTGAATGAAGTTCGCGCCGCCTATAA 

LRAGTIVSHPKLNEVRAAYK 
181  GCAGGAATTCGCCGTCGCGCTCGACTGCCTTCAGCTGATGGAAGAAAGGACGAATATCGA 

QEFAVALDCLQLMEERTNID 
241  CTTCCCGATCGATGAAGCGGGATTTTTGACGATGTTTTTTGCGTTCCACGAGGAGCAAGC 

FPIDEAGFLTMFFAFHEEQA 
3  01  GGAAGAGCGGGAGGAACGGGTGGCGATCGTGGTTGTGATGCATGGCAACGGTGTGGCTTC 

EEREERVAIVVVMHGNGVAS 
361  GGCGATGGCGGACGTCGTGAATCAGCTGTTGGCCGCCGCGTGTGTGCATGCGGTCGATAT 

AMADVVNQLLAAACVHAVDM 
421  GCCGCTTGATGCCGACCCGAAACGAATTTACGAGCAGGTGAAAGCGGTGCTACAGCCCGT 

PLDADPKRIYEQVKAVLQPV 
481  CGCATCGAAAAAAGGGGCGCTGTTGCTTGTTGACATGGGATCGCTCGTGTCGTTTTCGAA 

ASKKGALLLVDMGSLVSFSN 
541  CTTTTTGGAAAAAGAGCTCGCTGTTCCGGTGCGGGTGATTTCCGCTGCCAGCACGCCGCA 

FLEKELAVPVRVISAASTPH 
601  CGTGCTCGAAGCGGCCCGCAAAGCGATGCTCGGCTATGCGCTTCAGGAAATTTACGAAGA 

VLEAARKAMLGYALQEIYEE 
661  AGTGAAAGCCGCAGCGCCGTTTTACATTCGCGGGCCGCTTTGGGAGGAGGAGGCTGCGGA 

VKAAAPFYIRGPLWEEEAAE 
721  GCAGGACATGCTGGCGATCGTCACCGCCTGTTTTACTGGAAAAGGAAGCGCGCTGGCGCT 

QDMLAIVTACFTGKGSALAL 
781  AAAGCATATATTGGAAACGTATTTGCAGCTCGATGAGCGCATGTGGCGCATTATTCCCAT 

KHILETYLQLDERMWRI  IPI 
841  TCAAATGGCAGATGCGGAAGAAGCGCGGCAAACCTTGTCCAACGTCGCGAAACATTTCCG 

QMADAEEARQTLSNVAKHFR 
901  CATCGTCTGTATCGTCAGCCATCTTTGCCTTGATGAGCGAATCCCGCACTTTTCTCTTGA 

IVCIVSHLCLDERIPHFSLE 
961  GGACGTGCTTAGCTTGAGGGCGATGAAAGAAATCCAAGCGTTAGCCGACGTTGAGGAAAT 

DVLSLRAMKEIQALADVEEI 
1021  TCATATGCATATGGCTAGAGAGCTGACCAATCATCTTCGCCACCTCGCACCGGcGCGGGC 

HMHMARELTNHLRHLAPARA 
1081  CATTCCCGCCATTCGCGCTGCGCTGGCGGCGATTGGTCGGGAACTCGGCCTTGAAGCGGA 

IPAIRAALAAIGRELGLEAD 
1141  TGGCCGCGATCTGGTGGGGCTTGTCTTTCATCTTTGCTGCTTGCTCGATCGCCTGCTGTC 

GRDLVGLVFHLCCLLDRLLS 
1201  CGGAGAGAGAACGAGCGGTGACCGAGGAGCAGCGAAAGCTTGCGGGCATGAAGATGAAGA 

GERTSGDRGAAKACGHEDED 
12  61  TGGCGCACTGTACAGGGCGGTCAAGGAGGGGTTGTTTCCGCTTGAACAGCAGTACGGTGT 

GALYRAVKEGLFPLEQQYGV 
1321  CTGCATCGATGAAGATGAACTGTGTCATATCGTTCACTTTTTTCGCTCCCTGCAAGGAAA 

CIDEDELCHIVHFFRSLQGK 
1381  ACGAGATGGATAAGGTGCGATAATTGGAAACGCTTCCATTACACAGTTTCCCTCCGTTAA 

R   D   G   * 

1441  CACACAACCGTTCCATCCGCTTCAGCGCTGTCTTTTCGGCGTTGGCATGGAATTTGCTTG 

UUUCCUCC  -35 

1501  AACAATAATGAATCATTGCAAAAAAAGGAGGAGGAAGCGGATGAACATTTTGCrrrATTTf: 
-10  RBS  M   N   I   L   L   I   C 

CelA  ■+ 
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1561  CGCTGCCGGCATGTCGACGAGTTTGCTTGTGACGAAGATGAAGGAGGCGGCAAAGCAAAA 

AAGMSTSLLVTKMKEAAKQK 
1621  AGGGATCGAGGCGAACATTTGGGCTGTGTCAGCTGATGAAGCGAAAAGTCATCTCGACCA 

GIEANIWAVSADEAKSHLDQ 
1681  GGCGGATGTTGTGCTGATTGGTCCGCAAATCCGTTATAAGCTCGCCGCCTTCAAAAAAGA 

ADVVLIGPQIRYKLAAFKKE 
1741    GGGAGAGGCGCGCGGCATTCCGGTTGACGTCATCAATCCAGCTGACTATGGGCGGGTCAA 
GEARGIPVDVINPADYGRVN 

UUUccUCCaC 
1801  CGGCGCCGGCGTGTTGGACTTTGCGCTGCGGTTAAAAAAATAAACAGGGGGTAGAAGCGG 

GAGVLDFALRLKK*     RBS 
1861  ATGGACCGGTTTATTCGGATGTTGGAAGACCGCGTGATGCCTGTCGCCGGCAAGATTGCC 
MDRFIRMLEDRVMPVAGKIA 
CelB  ■* 

1921  GAACAGCGCCATTTGCAGGCGATTCGTGACGGGATTATTTTGTCGATGCCATTGTTGATT 

EQRHLQAIRDGIILSMPLLI 
1981  ATCGGGTCTTTATTTTTAATCGTTGGCTTTTTGCCGATTCCCGGTTACAACGAATGGATG 

IGSLFLIVGFLPIPGYNEWM 
2  041  GCGAAATGGTTCGGCGAGCATTGGCTTGATAAGCTTCTCTATCCGGTTGGAGCGACGTTC 

AKWFGEHWLDKLLYPVGATF 
2101  GATATTATGGCGCTTGTTGTCAGCTTCGGAGTCGCCTATCGGCTGGCGGAAAAGTATAAA 

DIMALVVSFGVAYRLAEKYK 
2161  GTGGATGCGCTGTCCGCCGGCGCGATTTCACTGGCTGCTTTTTTGCTCGCAACCCCGTAT 

VDALSAGAISLAAFLLATPY 
2221  CAAGTGCCGTTCACGCCGGAAGGAGCGAAAGAAACCATTATGGTCAGCGGTGGCATCCCG 

QVPFTPEGAKETIMVSGGIP 
2281  GTGCAATGGGTCGGCAGCAAGGGGTTGTTTGTTGCCATGATTTTGGCGATTGTGTCAACC 

VQWVGSKGLFVAMILAIVST 
2341  GAAATTTACCGGAAAATCATTCAAAAAAATATTGTCATTAAGCTGCCGGACGGGGTGCCG 

EIYRKIIQKNIVIKLPDGVP 
2401  CCTGCTGTGGCCCGCTCCTTTGTTGCTTTGATCCCGGGAGCCGCCGTTCTCGTCGTTGTC 

PAVARSFVALIPGAAVLVVV 
2461  TGGGTAGCCCGGCTGATTTTGGAAATGACACCGTTTGAAAGTTTCCATAACATTGTATCT 

WVARLILEMTPFESFHNIVS 
2521  GTCCTCCTGAACAAACCGCTCAGTGTGCTCGGCGGCAGTGTATTTGGCGCCATTGTCGCC 

VLLNKPLSVLGGSVFGAIVA 
2581  GTGCTGCTTGTCCAGCTGCTATGGTCGACCGGACTGCACGGAGCGGCCATCGTCGGCGGG 

VLLVQLLWSTGLHGAAIVGG 
2641  GTCATGGGGCCGATCTGGCTGTCGCTGATGGATGAAAATCGAATGGTGTTCCAGCAAAAT 

VMGPIWLSLMDENRMVFQQN 
2701  CCGAATGCCGAACTGCCCAACGTCATTACGCAACAGTTTTTTGATCTTTGGATTTACATC 

PNAELPNVITQQFFDLWIYI 
2761  GGCGGTTCAGGAGCGACATTGGCGTTGGCGTTGACGATGATGTTTCGGGCCCGCAGCCGG 

GGSGATLALALTMMFRARSR 
2821  CAGCTGAAAAGCTTAGGTCGGCTGGCGATCGCGCCCGGCATTTTCAATATTAATGAGCCG 

QLKSLGRLAIAPGIFNINEP 
2881  ATCACGTTCGGCATGCCGATCGTCATGAACCCATTGCTGATCATTCCGTTCATTCTCGTG 

ITFGMPIVMNPLLIIPFILV 
2941  CCTGTCGTGCTTGTGGTTGTTTCCTACGCCGCGATGGCGACCGGGCTTGTCGCCAAACCA 

PVVLVVVSYAAMATGLVAKP 
3  001  AGCGGGGTGGCCGTGCCATGGACGACACCGATCGTGATCAGCGGCTATTTAGCGACGGGA 

SGVAVPWTTPIVISGYLATG 
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3061  GGCAAAATTTCCGGGAGTATTTTGCAAATCGTTAACTTCTTCATCGCGTTTGCCATCTAC 
GKISGSILQIVNFFIAFAIY 

3121  TATCCATTTTTCTCGATTTGGGACAAACAAAAAGCGGCCGAAGAGCAGGCCGATCCAACG 
YPFFSIWDKQKAAEEQADPT 

uuuccuc 

3181  ATTTCAAGCGGAGCGGGGGCAACGCACTCGCTGTAAAGGAGAGGAACGAAATGCCGCGCT 
ISSGAGATHSL*      RBS  M      P      R      Y 

CelC  ■* 

3241  ATTGCATCGTCAACGCCGATGATTTCGGTTACTCGAAAGGGGTCAACTACGGGATTTTGG 

CIVNADDFGYSKGVNYGILE 

3301  AAGCGTTTCAGAACGGTGTCGTCACGTCGGCGACGCTGATGGCGAATATGCCAGCGGCCG 

AFQNGVVTSATLMANMPAAE 

3361  AACACGCCGCCCGGCTGGCGAAGGACCATCCGGAACTCGGCGTTGGCATTCATTTTGTGC 

HAARLAKDHPELGVGIHFVL 

3421  TGACGTGCGGCCGGCCGCTGGCCGATGTTCCATCGCTGGTGAACGAGAATGGGGAGTTTC 

TCGRPLADVPSLVNENGEFP 

3481  CGCGGCGCGGGGAGGCGCTTGTCGGCGCTAGGCGCGGCGATATCGAGCGGGAGCTTTGCG 

RRGEALVGARRGDI   ERELCA 

3  541  CCCAATTGGAGCGTTTTTTCTCGTTCGGGCTCACTCCGACGCATATTGACAGCCATCATC 

QLERFFSFGLTPTHIDSHHH 

3  601  ACGTTCATGAGCACCCGAATGTGTTTCCGGTTGTGGAACAATTGGCCGAACGCTATCGGC 

VHEHPNVFPVVEQLAERYRL 

3661  TGCCGATCCGCCCGGTGCGGACGGCACGGCCGCATCGGCTGCCGACCGTCGACGTCTTTT 

PIRPVRTARPHRLPTVDVFF 

3721  TTCCGGATTTTTACGGCGATGGACTGACGAAAGACCGTTTTATCTCGCTGATCGACCGAA 

PDFYGDGLTKDRFISLIDRI 

3781  TTGGCGACGGGCAGACGGCGGAAGTGATGTGCCACCCGGCGTATATCGATGTTCCGCTCG 

GDGQTAEVMCHPAYIDVPLA 

3841  CGTCAGGAAGTTCCTATTGTCAACAACGAGTCGAAGAGCTGGCGGTGCTGACCGACCCAA 

SGSSYCQQRVEELAVLTDPT 

3901  CGCTCGTTGCCGAGATGGCCGAGCGCGGTGTTCAGCTGATCACGTACCGCGAATTCTATA 

LVAEMAERGVQLITYREFYK 

UCCUCcaCU 

3961   AACTATAGGAGAGGGCGTTATGCAAACGTATGAACAAACTGTATTCCAACTGATTCTTCA 

L      *         RBS  MQTYEQTVFQLILH 

CelD  -» 

4021  TGGCGGAAACGGCCGCAGTTATGCAATGGAGGCGATTACGGCGGCGAAAAAAGGGGAATT 

GGNGRSYAMEAITAAKKGEF 
4081  TGCCGAGGCGCGCAGGCTGCTTGAACAGGCGGGAGCAGAACTGCAGGCGGCCCATGGGCT 

AEARRLLEQAGAELQAAHGL 
4141  GCAGACCGCGTTGCTGCAACAAGAAGCGAGCGGCGGGCAGCCGGTGGTGACGCTTCTGAT 

QTALLQQEASGGQPVVTLLM 
4201  GGTGCATGCCCAAGATCATTTAATGACGGCGATCACGGTCAAGGATTTAGCCGCTGAATT 

VHAQDHLMTAITVKDLAAEF 
4261  CGTGGAGCTGTATGAAGCGCTCAAGCGGCAAACAACCGAATCATAATTGTCGTCGGCCGG 

VELYEALKRQTTES   * 
4321  CTTGTTGACCGGGTGAGGATGATTCGCTTCAACGGGAATGAACCCAAGGCGGGCGGCAAC 


4381  GGGCTGTTCTAGCGACCGCCCGCTTTGTTGATTTTACAATAATTTTATTATGTGAACATA 


4441  AGGGCGCCAGCCGTTCAGCCCTCCCGGCTGCCGCCGGTCTCGGCTGAACGGAGAAGGAAA 
4501  ACTCCGCTTGTTTGCGTCACATATCGTGGCTTGTGTTGTGCTTTTGCCGCGTGTGATTGC 
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4561  GGTTCTTCACTTTTTTCGAACCGTCAAGCGGCTCCGGCTGCCCAGGATTGTCGCCTTTTG 
4621  GGGCGTTTTTGCGCATATCTTTGCCGTCGTTTTTGTTCGTCATTTGTTGTCCCTCCTTTG 
4681  TCCGTTAGCATGCGGCGTTGTGCTTTTGCTTATTCGGGGCATAATTTGCGCCGGTTTGAA 
4741  ACGATAAGTAGTGGATGACCGCGATGACTGCGAAGGAGGAACGAGCCATGGTTTATCATA 
4801  AAACGAAGCAAGACGCGTTTCAGGCGCGCGCAAAAGGCGACGATGGAGGCAAAAGAGTGG 
4861  CATGACCATCTCGTCCGCGATCAAGCCGATTACGGCCATCAGTTAGCCCATTTGCGGCAG 
4921  GAAGTGAACGAAGCGTTTGCACAAATTGAAAACGCGCTTGAAGTCGCCTCGGAAACGCAG 
4981  CGCATGCAGCTTGAGAAATTCCGCAGCGATTTGCAGGCGATCCCCGG 
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possible  rho-independent  terminator  at  the  3 '  end  (Rosenberg 
and  Court,  1979)  .  The  coding  regions  for  all  four  genes 
utilize  ATG  codons  to  initiate  transcription.  The  potential 
ribosomal-binding  sequences  for  celA  and  celC  provide  a  better 
match  with  the  3'  terminus  of  16s  rRNA  from  B. 
stearothermophilus  (Van  Charldorp  et  al.,  1981).  It  is 
interesting  to  note  that  the  potential  ribosomal  binding 
sequences  in  celB,  celC,  and  celD  are  overlapped  by  stop 
codons  from  the  preceding  genes,  and  may  provide  translational 
coupling.  A  similar  phenomenon  was  known  in  B.  subtilis 
sucrose-specific  PTS  (Fouet  et  al.,  1987),  in  which  the  stop 
codon  for  enzyme  II  overlapped  the  start  codon  for  sucrose-6- 
phosphate-hydrolase . 
Identification  of  Cloned  Genes  by  Homology 

An  initial  search  of  the  DNA  data  base  revealed  strong 
homology  between  the  translated  sequences  of  the  cloned  genes 
and  previously  described  genes  encoding  sugar-specific 
proteins  in  PTS  systems  (Table  8)  .  The  highest  degree  of 
similarity  was  observed  with  the  E.  coli  eel  operon  (Table  8 
and  Fig.  5)  .  Since  MUC  and  MUG  can  be  considered  analogues  of 
cellobiose,  the  sequenced  genes  from  B.  stearothermophilus  XL- 
65-6  were  designated  as  a  eel  operon,  the  first  PTS  cellobiose 
system  to  be  characterized  at  a  molecular  level  in  a  Gram 
positive  organism.  Table  9  summarizes  some  of  the  properties 
of  these  four  genes.   The  gene  products  from  celA   and  celB 
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Table  8.  Comparison  of  predicted  amino  acid  sequences  for 
celA,  B,  and  D  and  celR'  from  B.  stearothermopMIus  to 
homologous  PTS  polypeptides  from  other  organisms. 


%Identitv 

f %Similaritv) 

Proteins 

Bs -CelAa 

Bs-CelBa 

Bs-CelDa               Bs 

-•CelRa 

EIIBC 

Ec-Ce lAb 

42.9    (59.2) 

LI  -LacEc 

23.2     (50.5) 

30.6    (58.2)k 

Sa-LacEd 

21.9    (51.0) 

29.4     (57.7)k 

Lc-LacEe 

16.0    (48.0) 

29.1    (55.9)k 

£c-CelBb 

32.9    (60.1) 

EIIA 

Bc-CelCb 

41.6    (67.3) 

Ll-LacFc 

41.8    (66.0) 

Sa-LacFd 

38.4     (62.6) 

Lc-FIIIe 

36.5    (58.9) 

Requlatory   proteins 

Bsu -LevRf 

29.5 

(50.1) 

Bsu-SacY9 

27.2 

(57.0) 

Bsu-SacTh 

30.0 

(58.0) 

£ch-ArbG' 

30.2 

(54.2) 

a  Bs-CelA,  Bs-CelB,  and  Bs-CelD  refers  to  polypeptides  for 

gene  celA,    celB,    and  celD   of  eel   operon  from  B. 

stearothermophilus .      Bs-'CelR  refers  to  the  incomplete 

regulatory  protein. 
b  £c-CelA,  .Ec-CelB,  and  £c-CelC  refer  to  EIIBCcl,  EIICCel,  and 

EIIACel  of  glucoside-specific  PTS  enzyme  II  of  eel   operon 

from  E.    coll    (Parker  and  Hall,  1990;  Reizer  et  al.,  1990). 
c  LI-LacE  and  Ll-LacF  refer  to  lactose-specific  PTS  operon 

EIIBCLac  and  EIIALac  from  Lactococcus  lactis ,   respectively  (de 

Vos  et  al.,  1990) . 
d  Sa-LacE  and  Sa-LacF  refer  to  lactose-specific  PTS  EIIBCLac 

and  EIIALac  of  lac-operon  from  Staphyloccus   aureus, 

respectively  (Breidt  et  al.,  1987). 
e  Lc-LacE  and  ic-FIII  refer  to  lactose-specific  PTS  EIIBCLac 

and  EIIALac  from  Lactobacillus   casei    respectively,  (Alfred 

and  Chassy,  1988;  Alfred  and  Chassy,  1990). 
f  Bsu-LevR  refers  to  transcriptional  regulator  of  PTS  levanase 

operon  of  B.  subtilis    (Debarbouille  et  al.,  1991).   The 

result  of  comparison  with  Bs-' celR  were  from  the  comparison 

of  amino  acid  2  to  459  with  amino  acid  475  to  935  of  levR 

gene  product. 
9  Bsu-SacY  refers  to  antiterminator  of  PTS  levansucrase  gene 

of  B.  subtilis    (Zukowski  et  al.,  1990).   The  result  of 

comparison  with  Bs-' celR  were  from  the  comparison  of  amino 
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Table  8  —  continued 

acid  2  to  115  with  amino  acid  68  to  181  of  sacY   gene 

product . 
h  Bsu-SacT  refers  to  transcriptional  antiterminator  of  PTS 

sucrase  operon  of  B.  subtilis    (Debarbouille  et  al.,  1990). 

The  result  of  comparison  with  Bs-' CelR  were  from  the 

comparison  of  amino  acid  2  to  101  with  amino  acid  66  to  165 

of  sacT   gene  product. 
'  Ech-hrbG   refers  to  antiterminator  of  PTS  phospho-6- 

glucosidase  operon  of  Erwinia   chrysanthemi    (El  Hassouni, 

1992)  .  The  result  of  comparison  with  Bs-' celR  were  from  the 

comparison  of  amino  acid  2  to  97  with  amino  acid  69  to  164 

of  arbG   gene  product. 
J  These  results  are  comparisons  of  Bs-CelA  with  carboxyl 

terminal  portions  of  Ll-LacE,  Sa-LacE,  and  ic-LacE, 

respectively . 
k  These  results  are  comparisons  of  Bs-CelB  with  amino  terminal 

portions  of  U-LacE,  Sa-LacE,  and  Lc-LacE,  respectively. 
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have  high  pi  values  and  positive  charges  while  those  from  celC 
and  celD  have  low  pi  values  and  negative  charges. 

The  translated  amino  acid  sequence  for  B. 
stBarothermophilus  celA  shares  43%  identity  and  59%  similarity 
with  that  of  celA  from  E.  coli  K12  (Parker  and  Hall,  1990) 
which  encodes  a  part  of  enzyme  II  (EIIB)  (Reizer  et  al.,  1990) 
(Table  8  and  Fig.  5A) .  In  many  organisms,  the  corresponding 
domains  encoded  by  celA  and  celB  are  fused  into  a  single 
polypeptide  (EIIBC)  (Saier  and  Reizer,  1992)  .  The  translated 
B.  stearothermophilus  celA  sequence  is  homologous  to  the 
carboxyl  terminal  portion  (EIIB  domain)  of  the  lactose- 
specific  enzyme  II  proteins  from  Lactococcus  lactis, 
Staphylococcus  aureus,  and  Lactobacillus  casei  (Table  8)  . 
Cys7  is  surrounded  by  a  highly  conserved  region  in  all 
homologues  examined.  This  Cys  residue  has  been  proposed  as 
the  site  of  phosphorylation  in  L.  lactis  (Alfred  and  Chassy, 
1990)  .  The  celA  gene  is  predicted  to  encode  a  protein  with  Mr 
10,737  (Table  9)  . 

The  celB  gene  begins  17  bases  downstream  from  celA  and  is 
preceded  by  a  potential  ribosomal-binding  site  which  overlaps 
the  last  2  bases  of  the  celA  stop  codon.  The  celB  gene  is 
predicted  to  encode  a  protein  with  Mr  48,805.  The  deduced 
amino  acid  sequence  for  celB  is  most  similar  to  that  of  the 
membrane-spanning  polypeptide  in  the  enzyme  II  complex  (EIIC) 
for  cellobiose  from  E.  coli  (celB)  (Parker  and  Hall,  1990), 
exhibiting  33%  identity  and  60%  similarity  (Table  8  and  Fig. 
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5B)  .  The  translated  sequence  for  B.  stearothermophilus  eel  B 
is  also  similar  to  the  amino  terminal  portion  of  genes 
encoding  PTS  enzyme  II  proteins  from  other  organisms  (Table 
8). 

The  celC  gene  begins  14  bases  downstream  from  the  celB 
stop  codon  and  also  exhibits  an  overlap  between  the  ribosomal- 
binding  region  and  the  preceding  stop  codon.  In  contrast  to 
the  other  genes  in  the  eel  operon,  no  genes  were  found  in  the 
current  data  base  with  similarity  to  the  translated  sequence 
for  celc.  However,  the  carboxyl-terminal  portion  of  this 
protein  exhibited  partial  homology  (22%  identity,  53% 
similarity)  with  the  carboxyl  portion  of  E.  coli  celF  encoding 
the  phospho-6-glucosidase.  Thus  the  celC  gene  is  presumed  to 
encode  a  cleavage  enzyme  for  cellobiose-phosphate. 

The  celD  gene  encodes  a  small  protein  containing  108 
amino  acids.  Again,  the  proposed  ribosomal  binding  site 
overlaps  the  stop  codon  of  the  preceding  gene.  The  translated 
amino  acid  sequence  for  celD  exhibits  41.6%  identity  and  67.3% 
similarity  to  that  of  B.  coli  celC  which  encodes  enzyme  III 
(EIIA)  (Table  8  and  Fig.  5C)  .  Table  8  summarizes  the  homology 
between  celD  and  related  genes  from  other  organisms.  A 
histidine  residue  (His76)  is  located  in  a  region  which  is 
highly  conserved  among  different  organisms.  This  residue  has 
been  proposed  as  the  site  of  phosphorylation  for  the  lactose- 
specific  enzyme  III  in  L.  casei  (Alfred  and  Chassy,  1988; 
Hengstenberg  et  al.,  1989). 
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An  incomplete  open  reading  frame  (Fig.  3  and  4)  was  found 
upstream  from  the  eel  operon  which  contained  463  codons  and  is 
transcribed  in  the  same  direction  as  the  eel  operon.  This 
gene  was  provisionally  identified  as  encoding  a  regulatory 
protein  (eel J?)  based  on  homology  to  regulatory  proteins  in 
other  PTS  systems  (Table  8)  .  The  translated  sequence  for  the 
incomplete  celR  gene  is  most  similar  to  that  of  B.  subtilis 
levR  (de  Vos  et  al.,  1990),  the  transcriptional  regulator  for 
the  PTS  levanase  operon.  Although  a  putative  promoter  for 
celABCD  was  readily  evident  in  the  intercistronic  sequence,  it 
is  unclear  whether  celR  is  part  of  the  eel  operon  or 
transcribed  separately.  No  stem-loop  structure  resembling  a 
rho-independent  terminator  was  found  in  the  intergenic  region 
and  it  is  possible  that  all  five  genes  may  be  transcribed  at 
the  some  level  from  a  common  promoter. 

A  portion  of  the  amino  terminus  of  a  possible  sixth  gene 
was  found  downstream  from  the  eel  operon  but  was  not 
identified  by  homology  searches  of  the  current  DNA  data  base. 
Codon  Usage 

Codon  usage  in  the  eel  genes  was  compared  to  that  for  the 
genes  encoding  phosphoglycerate  kinase  (Davies  et  al.,  1991) 
and  alcohol  dehydrogenase  T  (Sakoda  and  Imanaka,  1992)  from  B. 
stearothermophilus  (Table  10).  Many  codons  (ie.  AGA,  TGA, 
TAG,  ATA,  and  TGT)  were  rarely  used.  Either  a  G  or  C  was 
usually  present  in  the  wobble  positions  resulting  in  G+C 
contents  of  52.8  to  54.9  for  celAB  and  celD,   respectively. 
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Table  10. 

Codon 

Usage  of 

eel    Ope 

con 

Codon 

Mole 

AmAcid 

celA 

celB 

celC 

celD 

gap 

adhT 

Gly 

GGG 

2.00 

2.19 

2.57 

2.18 

0.62 

1.82 

GGA 

1.04 

2.02 

0.88 

1.85 

1.17 

1.82 

GGT 

1.04 

0.93 

0.88 

0.00 

0.28 

4.70 

GGC 

4.00 

3.29 

3.01 

2.81 

4.81 

2.35 

Glu 

GAG 

4.00 

0.67 

4.33 

2.75 

1.17 

0.61 

GAA 

1.00 

2.66 

3.01 

7.44 

5.70 

6.21 

Asp 

GAT 

1.98 

1.34 

3.03 

1.85 

2.09 

3.58 

GAC 

4.02 

1.10 

2.69 

0.00 

3.88 

1.46 

Val 

GTG 

4.00 

3.57 

2.87 

3.72 

4.18 

2.35 

GTA 

0.00 

0.71 

0.45 

0.94 

0.36 

5.09 

GTT 

2.00 

2.24 

2.42 

0.00 

2.03 

1.44 

GTC 

2.00 

3.77 

3.23 

0.94 

5.37 

4.18 

Ala 

GCG 

6.00 

4.69 

4.11 

9.29 

5.70 

1.82 

GCA 

1.05 

0.42 

0.34 

1.89 

1.16 

3.53 

GCT 

4.05 

0.94 

0.77 

0.94 

1.16 

2.93 

GCC 

4.05 

4.48 

3.34 

3.78 

3.61 

1.82 

Arg 

AGG 

0.00 

0.00 

0.41 

0.93 

0.00 

0.00 

AGA 

0.00 

0.00 

0.00 

0.00 

0.00 

0.00 

CGG 

2.00 

1.77 

3.67 

0.93 

0.31 

0.00 

CGA 

0.00 

0.22 

1.22 

0.00 

0.31 

0.00 

CGT 

1.00 

0.22 

0.82 

0.00 

0.31 

1.18 

CGC 

1.00 

0.90 

2.04 

1.85 

3.58 

1.50 

Ser 

AGT 

2.00 

0.90 

0.37 

0.92 

0.00 

0.00 

AGC 

0.00 

1.80 

0.37 

0.92 

0.59 

0.29 

TCG 

1.00 

1.14 

1.84 

0.00 

3.87 

0.59 

TCA 

1.00 

0.90 

0.73 

0.92 

0.00 

0.88 

TCT 

0.00 

0.42 

0.00 

0.00 

0.32 

0.29 

TCC 

0.00 

0.90 

0.37 

0.00 

0.59 

0.59 

Lys 

AAG 

4.00 

1.10 

0.54 

1.85 

0.89 

1.16 

AAA 

6.00 

1.38 

1.09 

1.85 

5.97 

7.14 

Asn 

AAT 

1.00 

1.33 

1.23 

0.00 

0.60 

0.59 

AAC 

3.00 

1.33 

1.63 

0.93 

5.37 

1.78 

Met 

ATG 

3.00 

3.77 

2.04 

3.70 

2.39 

0.89 

Table   10   —  continued 
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Codon 

Mole  % 

AmAcid 

celA 

celB 

celC 

celD 

gap 

adhT 

He 

ATA 

0.00 

0.00 

0.36 

0.00 

0.00 

0.54 

ATT 

5.04 

6.34 

1.89 

1.86 

2.09 

3.56 

ATC 

3.04 

3.77 

2.25 

0.92 

3.88 

1.78 

Thr 

ACG 

2.00 

2.00 

3.28 

4.67 

3.57 

0.29 

ACA 

0.00 

0.68 

0.00 

0.92 

0.62 

2.68 

ACT 

0.00 

0.00 

0.83 

0.92 

0.00 

0.29 

ACC 

0.00 

1.09 

0.83 

1.83 

1.47 

0.00 

Trp 

TGG 

1.00 

2.22 

0.00 

0.00 

0.60 

1.19 

End 

TGA 

0.00 

0.00 

0.00 

0.00 

0.00 

0.00 

TAG 

0.00 

0.00 

0.00 

0.00 

0.00 

0.00 

TAA 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

Cys 

TGT 

0.00 

0.00 

0.67 

0.00 

0.00 

0.59 

TGC 

1.00 

0.00 

1.37 

0.00 

0.60 

2.08 

Tyr 

TAT 

2.00 

1.34 

2.09 

2.78 

0.60 

2.09 

TAC 

0.00 

1.10 

1.58 

0.00 

1.79 

1.18 

Leu 

TTG 

3.00 

3.51 

1.56 

0.91 

5.64 

0.31 

TTA 

1.00 

0.90 

0.00 

1.81 

0.56 

3.86 

CTG 

2.00 

3.51 

4.67 

6.48 

0.89 

0.31 

CTA 

0.00 

0.23 

0.45 

0.00 

0.00 

0.00 

CTT 

1.00 

1.58 

0.90 

2.72 

0.32 

2.39 

CTC 

3.00 

1.58 

1.71 

0.91 

0.56 

0.93 

Phe 

TTT 

1.00 

2.88 

3.98 

0.92 

0.60 

1.18 

TTC 

1.00 

2.66 

1.33 

1.86 

0.80 

0.89 

Gin 

CAG 

0.99 

1.55 

0.94 

3.67 

0.00 

0.00 

CAA 

2.01 

1.55 

1.91 

6.52 

0.90 

3.56 

His 

CAT 

1.00 

0.67 

2.60 

3.70 

2.10 

2.67 

CAC 

0.00 

0.44 

1.89 

0.00 

0.59 

0.00 

Pro 

CCG 

2.01 

3.79 

5.76 

0.93 

3.28 

1.46 

CCA 

0.99 

1.35 

1.18 

0.00 

0.00 

0.91 

CCT 

0.00 

0.64 

0.00 

0.00 

0.00 

2.67 

CCC 

0.00 

0.64 

0.00 

0.00 

0.00 

0.00 

80 
The  G+C  content  of  celC   is  unusually  high  (59.1%)  and  differs 
significantly  from  other  genes  in  this  operon. 
Southern  hybridization  analysis 

Southern  analysis  confirmed  that  the  eel  genes  originated 
from  B.  stearothermophilus  XL-65-6.  A  Sty  I  fragment  from  the 
internal  region  of  the  eel  operon  was  used  as  a  probe.  This 
probe  did  not  hybridize  to  DNA  fragments  from  E.  coli  DH5a  and 
bound  single  regions  in  digestions  of  XL-65-6  genomic  DNA, 
consistent  with  the  eel  operon  being  present  as  a  single 
chromosomal  copy  (Fig.  6) . 
Hydropathy  analysis 

Analyses  of  the  deduced  amino  acid  sequences  by  the 
method  of  Kyte  and  Doolittle  (1982)  revealed  a  hydrophilic  or 
slightly  hydrophobic  character  for  the  products  of  celA,  celC , 
and  celD  (Fig.  7)  .  Further  analysis  for  membrane  spanning 
domains  by  the  method  of  Klein  et  al.  (1985)  predicted  a 
single  domain  in  the  eel  A  product  which  may  serve  as  a 
membrane  anchor.  No  membrane-spanning  domains  were  predicted 
in  the  celC  and  celD  products.  Hydropathy  profiles  for  the 
celA  and  celD  encoded  polypeptides  exhibit  a  remarkable  degree 
of  conservation  when  compared  to  homologous  PTS  enzymes  from 
other  organisms  despite  differences  in  the  transported  sugars 
(Fig.  7)  .  The  celB  gene  product  is  very  hydrophobic  with  9  to 
11  predicted  membrane-spanning  domains.  The  hydropathy  plot 
for  this  protein  is  also  very  similar  to  the  celB   homologues. 

Reizer  et  al.  (1990)  previously  noted  that  the  N-terminal 


Figure  6.  Southern  hybridization  analysis  of  eel  operon. 
Lane  assignment:  Lane  1  to  lane  5:  styl,  pstl,  Hindlll, 
EcoRl,  and  BssHII  digestions  of  B.  stearothermophilus  XL- 
65-6  genomic  DNA;  Lane  6:  ptsl  digestion  of  E.  coli  DH5a 
genomic  DNA;  Lane  7:  A.  marker. 
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region  of  E.   coli  celB   gene  product  and  the  C-terminal  region 

of  the  E.    coli    celC   translation  product  can  be  folded  into 

helical  wheel  plots  to  reveal  distinctly  hydrophobic  and 

hydrophilic  faces.   A  similar  analysis  of  the  corresponding 

regions  in  the  translated  sequences  for  B.  stearothermophilus 

celB     and  celD     confirmed  that  these  patterns  were  also 

maintained  in  this  Gram  positive  organism  (Fig.  8)  .   When 

folded,  the  first  16  amino  acids  at  the  amino  terminus  of  the 

B.  stearothermophilus  celB   encoded  protein  produced  a  highly 

hydrophobic  face  which  may  be  involved  in  the  formation  of  the 

transport  complex  but  is  not  predicted  to  span  the  membrane 

(Klein  et  al.,  1985).     Similarly,  the  carboxyl  terminal 

region  of  the  celD    gene  product  (close  to  the  histidinyl 

residue  of  the  proposed  phosphorylation  site)  can  also  be 

folded  into  an  a-helix  in  which  most  hydrophobic  residues  are 

segregated  on  one  side  and  may  function  in  the  assembly  of  the 

active  enzyme  II  complex. 

Functional  analysis  of  genes  in  the  B.  stearothermophilus  eel 
operon  in  E.   coli 

The  B.    stearothermophilus   eel   genes  were  constitutively 

expressed   in   strain   DH5a(pLOI902) .     Recombinant   cells 

hydrolysed  MUC,  MUG,  and  p-nitrophenyl-glucopyranoside  (pNPG)  , 

and  formed  slight  pink  colonies  when  growing  on  cellobiose- 

MacConkey  agar,  indicating  poor  utilization  of  cellobiose. 

All  activities  were  concurrently  lost  as  a  result  of  deletions 

in  the  carboxyl  terminal  region  of  celD    and  frame-shift 

mutations  in  celB   (Fig.  3)  .  The  full  insert  was  subcloned  in 
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the  reverse  orientation  to  provide  transcription  from  the  lac 
promoter.  Deletions  in  the  amino  terminal  region  of  celA  also 
resulted  in  a  loss  of  activity  (Fig.  3).  A  variety  of 
approaches  were  tried  to  isolate  mutations  in  the  celC  gene 
without  success.  Thus  a  requirement  for  three  of  the  four 
genes  within  the  eel  operon  has  been  demonstrated  while  the 
fourth  gene,  celC,  is  presumed  to  encode  a  cleavage  enzyme  for 
cellobiose-phosphate. 

Cells  of  strain  DH5a(pLOI902)  exhibited  strong  MUC,  MUG 
and  pNPG  activities  when  incubated  at  37°C.  However,  little 
activity  was  detected  when  cells  were  incubated  at  55°C 
despite  the  thermal  tolerance  of  B.  stearothermophilus . 
Previous  studies  had  demonstrated  that  the  general  proteins 
(EI  and  HPr)  of  the  E.  coli  PTS  complement  sugar-specific 
components  from  other  Gram  positive  bacteria  (de  Vos  et  al., 
1990;  Fouet  et  al.,  1987).  The  loss  of  activity  at  55°C  could 
result  from  thermal  inactivation  of  essential  host-supplied 
components  (enzyme  I) .  To  test  this  hypothesis,  a  ptsl  mutant 
of  E.  coli,  MM6  (Fraenkel  et  al.,  1964),  was  transformed  with 
PLOI902.  The  resulting  recombinant  was  inactive  at  both 
temperatures . 

The  requirement  for  host  EI  (and  presumably  also  HPr)  for 
activity  at  both  temperatures  indicates  that  the 
phosphorylation  of  chromogenic  substrates  may  be  essential 
prior  to  cleavage. 
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Discussion 

The  chromogenic  substrate  MUC  has  often  been  used  to 
measure  exocellobiohydrolase  activity,  although  potential 
confounding  activities  from  B-glucosidases  are  well  known 
(Coughlan,  1988).  However,  the  discovery  that  MUC  and  MUG 
activities  could  be  used  as  a  marker  for  the  cloning  of  a 
cellobiose  PTS  operon  from  a  Gram  positive  bacterium,  B. 
stearothermophilus ,  was  rather  unexpected. 

The  organization  of  the  PTS  has  been  described  by  many 
researchers  (Saier  and  Reizer,  1992).  This  system  consists  of 
two  general  energy-coupling  proteins,  enzyme  I  and  HPr,  as 
well  as  a  sugar-specific  permease,  commonly  referred  as  the 
enzyme  II  complex.  Although  the  enzyme  II  complex  may  consist 
of  1,  2,  3,  or  4  distinct  polypeptide  chains,  each  complex 
contains  at  least  three  functional  domains:  a  hydrophobic 
trans-membrane  domain  which  binds  and  transports  the  sugar,  a 
closely  associated  hydrophilic  domain  which  possesses  the 
first  phosphorylation  site,  and  a  second  hydrophilic  domain 
containing  an  additional  phosphorylation  site.  In  B. 
stearothermophilus,  celB  encodes  the  membrane-spanning 
polypeptide  which  forms  the  transmembrane  channel.  The  celA 
product  is  less  hydrophobic  and  contains  a  hydrophobic  tail 
which  may  serve  as  a  membrane  anchor.  The  celA  product  is 
predicted  to  have  a  high  pi,  similar  to  that  of  the  celB 
product.  Since  celA  and  celB  encode  domains  which  form  a 
single  polypeptide  in  many  organisms  (Saier  and  Reizer,  1992), 
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these  two  gene  products  can  be  assumed  to  interact  closely  in 
the  enzyme  II  complex.  The  celD  product  is  hydrophilic 
although  a  hydrophobic  surface  appears  to  be  present  near  the 
carboxyl  terminus.  This  gene  product  is  predicted  to  have  a 
low  pi  which  may  allow  the  formation  of  ionic  interactions 
with  both  the  celA  and  celB  encoded  polypeptides.  The  celC 
product  is  also  very  hydrophilic  and  is  proposed  to  encode  an 
enzyme  for  the  cleavage  of  cellobiose-phosphate,  based  on 
homology  with  E.  coli  celF.  All  attempts  to  mutate  this  gene 
were  unsuccessful. 

In  recombinant  E.  coli  expressing  MUC  and  MUG  activities, 
enzyme  I  (and  presumably  HPr)  must  be  supplied  by  the  host  and 
complement  the  PTS  genes  from  B.  stearothermophilus .  Other 
examples  of  functional  complementation  with  heterologous  PTS 
systems  from  Gram  positive  organisms  have  been  reported 
previously  in  recombinant  E.  coli  (de  Vos  et  al.,  1990;  Fouet 
et  al. ,  1987) . 

MUC,  MUG,  and  pNPG  are  analogues  of  cellobiose  and 
require  hydrolysis  to  release  the  respective  chromogens. 
Since  a  functional  E.  coli  enzyme  I  (ptsJ)  was  required  for 
hydrolysis,  phosphorylation  can  be  presumed  to  be  essential. 
In  the  simplest  case,  the  phosphorylated  intracellular  product 
must  be  hydrolysed  once  to  release  the  chromogen  from  MUG  or 
twice  to  release  the  chromogen  from  MUC.  However,  it  is 
possible  that  the  transported  substrate  is  cleaved  by  a 
phosphorylase  as  proposed  for  Cellulomonas  uda   and  C.  favigena 
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(Coughlan  and  Mayer,  1992) .  A  phosphorylase  would  produce  4- 
methylumbelliferyl-phosphate  or  p-nitrophenyl-phosphate  as 
products  which  must  be  cleaved  by  a  host  phosphatase  to 
release  the  chromogen.  The  precise  nature  of  the  reaction 
catalyzed  by  the  proposed  cleavage  enzyme  (celC)  is  still 
unknown . 

The  PTS  cellobiose-specific  proteins  from  the  Gram 
positive  thermophile  B.  stearothermophilus  (celA,  celB,  and 
celD)  appear  remarkably  similar  to  the  corresponding  genes 
from  the  Gram  negative  mesophile,  E.  coli  celA,  celB,  and  celC 
(Parker  and  Hall,  1990)  and  may  be  derived  from  a  common 
ancestor.  Indeed,  the  sugar-specific  proteins  from  most 
organisms  which  have  been  examined  exhibit  a  high  degree  of 
homology  despite  differences  in  the  transported  substrates 
(Saier  and  Reizer,  1992).  However,  the  proposed  phospho-6- 
glucosidases  from  these  two  organisms  (celC  in  B. 
stearothermophilus  and  celD  in  E.  coli,  respectively)  exhibit 
only  modest  homology  to  each  other  and  little  homology  to 
other  glycohydrolases,  consistent  with  an  independent  origin. 
The  independent  origin  of  celC  is  further  supported  by  the 
high  G+C  composition  of  this  gene,  considerably  higher  than 
that  of  other  sequenced  genes  from  B.  stearothermophilus .  The 
discovery  of  a  PTS  eel  operon  in  B.  stearothermophilus  which 
is  similar  to  that  in  E.  coli  (Parker  and  Hall,  1990)  is 
consistent  with  the  PTS  being  the  prevalent  route  for 
cellobiose  uptake  in  bacteria. 
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Transcription  of  B.  stearothermophilus  eel  operon  may  be 
regulated  by  a  mechanism  comparable  to  that  for  lev  operon  in 
B.  subtlis.  Transcription  of  B.  subtilis  lev  operon,  a  PTS 
levanase  operon,  is  controlled  by  a  regulator,  LevR.  LevR 
functions  as  a  repressor  in  the  uninduced  state  and  as  an 
activator  after  induction  with  fructose.  The  gene  encoding 
this  regulator  is  located  upstream  of  the  native  promoter  of 
lev  operon  (Debarbouille  et  al. ,  1991).  B.  stearothermophilus 
'CelR  exhibits  homology  to  carboxyl  terminal  portion  of  B. 
subtilis  LevR,  and  genes  encoding  these  two  enzymes  are 
arranged  in  a  comparable  pattern.  Therefore,  CelR  is  expected 
to  have  a  similar  function  to  LevR.  'CelR  is  also  homologous 
to  other  transcriptional  antiterminators  from  B.  subtilis 
(Debarbouille  et  al.,  1990;  Zukowski  et  al.,  1990).  Thus  CelR 
may  function  as  a  transcriptional  antiterminator  in  B. 
stearothermophilus . 


CHAPTER  5 

CLONING  AND  SEQUENCING  OF  THE 

PHOSPHOENOLPYRUVATE-DEPENDENT  PHOSPHOTRANSFERASE 

GENES  (PTSHIT)    FROM  BACILLUS   STEAROTHERMOPHILUS 

AND  HYPEREXPRESSION  IN  ESCHERICHIA   COLI 


The  phosphoenolpyruvate-dependent  phosphotransferase 
system  (PTS)  is  a  complex  carbohydrate  uptake  system  which  has 
a  variety  of  physiological  functions  in  bacteria  (Postma  et 
al.,  1993;  Saier  and  Peizer,  1992).  This  system  consists  of 
two  cytoplasmic  energy-coupling  proteins,  enzyme  I  (EI)  and 
histidine-containing  phosphocarrier  protein  (HPr) ,  and  sugar- 
specific  permeases  referred  to  as  enzyme  II  complexes  (EII) . 
The  PTS  functions  to  transfer  a  phosphoryl  group  from 
phosphoenolpyruvate  (PEP)  to  EI,  to  HPr,  and  then  to  the  EII 
complex.  The  EII  complex  catalyzes  carbohydrate  uptake  with 
concomitant  phosphorylation. 

In  the  previous  chapter,  a  cellobiose  PTS  operon  (eel 
operon)  from  an  isolate  of  Bacillus  stearothermophilus ,  strain 
XL-65-6  was  described.  Cellobiose  utilization  by  Escherichia 
coli  recombinants  harboring  this  operon  was  poor  (slight 
acidification  of  MacConkey  agar  containing  cellobiose) 
although  clones  were  easily  identified  using  chromogenic 
cellobiose  analogues.  Since  Els  from  different  organisms  have 
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been  reported  to  complement  poorly  in  heterologous  systems 
(Postma  et  al.,  1993),  the  inability  of  recombinant  E.  coli  to 
utilize  cellobiose  efficiently  may  result  from  poor 
compatibility  between  the  native  general  proteins  (EI  and  HPr) 
and  the  cellobiose-specif ic  B.  stearothermophilus  gene 
products. 

The  aim  of  this  objective  was  the  cloning  and  sequencing 
ptsH  and  ptsl  genes  from  B.  stearothermophilus   strain  XL-65-6, 
and  improvement  of  cellobiose  utilization  in  recombinant  E. 
coli   by  combining  eel   operon  and  ptsHI   operon. 
Materials  and  Methods 

Bacterial  Strains.  Plasmids.  and  Media 

Bacterial  strains  and  plasmids  used  in  this  study  are 
listed  in  Table  11.  Luria  broth  (or  agar)  were  used  for 
growth  and  maintenance  of  E.  coli  (Atlas  and  Parks,  1993) . 
Acidification  of  Difco  MacConkey  agar  base  plus  1.0% 
carbohydrate  was  used  as  a  test  for  carbohydrate  utilization. 
Complementation  of  E.  coli  ptsl  mutations  was  evaluated  using 
MacConkey  agar  containing  each  of  5  PTS  carbohydrates 
(fructose,  glucose,  mannose,  mannitol,  or  sorbitol). 
Sorbitol-MacConkey  agar  was  used  to  test  functional 
complementation  of  the  ptsH  mutation  in  E.  coli  1101. 
Carbohydrate  utilization  was  also  examined  in  M9  minimal 
medium  supplemented  with  0.2%  carbohydrate  (Atlas  and  Parks, 
1993).  Ampicillin  (50  nq  ml"1)  was  added  to  the  media  as 
appropriate  for  the  selection  of  recombinants. 
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DNA  Manipulation 

Standard  procedures  were  used  for  the  construction, 
isolation,  and  analysis  of  plasmids  (Sambrook  et  al.,  1989). 
Commercial  restriction  enzymes  were  used  as  recommended  by 
each  manufacturer. 
Cloning  of  B.   stearothermophilus  ptsHIT   Operon 

Clones  containing  the  XL-65-6  ptsH  and  ptsl  genes  were 
isolated  by  transforming  an  E.  coli  ptsl  mutant,  strain  MM6 
(Fraenkel  et  al.,  1964),  with  a  previously  constructed  pUC18- 
based  genomic  library.  Transformants  were  screened  for  the 
utilization  of  PTS  sugar  by  plating  on  MacConkey  agar 
containing  fructose.  Deep  red  colonies  were  selected  after 
overnight  incubation  at  3  0°C  and  purified  for  further 
investigation.  E.  coli  DH5a  was  used  as  the  host  for 
subcloning  experiments. 
Southern  Hybridization 

Restriction  endonuclease  digestions  of  genomic  DNA  from 
strain  XL-65-6  were  prepared  using  clal,  EcoRI,  PstI,  Sail, 
and  Smal.  After  separation  by  agarose  gel  electrophoresis, 
fragments  were  transferred  to  Zeta-Probe  GT  membranes  (Bio-Rad 
Laboratories,  Richmond,  Calif.)  and  probed  with  a  1.3-kbp 
EcoRI  fragment  spanning  parts  of  ptsH  and  ptsJ  from  B. 
stearothermophilus  XL-65-6.  This  probe  was  labeled  with 
digitoxigenin  (Genius  System;  Boehringer  Mannheim 
Biochemicals,   Indianapolis,   Ind.)   and  hybridized  at  55°C 
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(Sambrook  et  al.,  1989).   Genomic  DNA  from  E.    coli   DH5a  was 

digested  with  PstI  and  served  as  a  negative  control. 

DNA  Sequencing  and  Sequence  Analysis 

DNA  sequencing  and  sequence  analysis  were  as  described  in 

the  previous  chapter.   The  nucleotide  sequence  data  has  been 

submitted  to  GenBank  and  assigned  the  accession  number  U12340. 

Sodium  Dodecyl  Sulfate  Polyacrvlamide  Gel  Electrophoresis 
fSDS-PAGE) 

Cultures  of  E.    coli   JLT2  harboring  recombinant  plasmids 

were  grown  in  Luria  broth  with  shaking  at  30°C  and  37°C 

overnight.  Cells  were  harvested  by  centrifugation  and  washed 

twice  with  10  mM  sodium  phosphate  buffer  (pH  7.0).   Cell 

pellets  were  resuspended  in  an  equal  volume  of  this  buffer  and 

disrupted  by  two  passes  through  a  French  pressure  cell  at 

20,000  lb  in"2.   Cell  debris  was  removed  by  centrifugation 

(13,000  X  g,  10  min) .   Nucleic  acid  was  digested  by  adding 

CaCl2  (2  mM  final  concentration)  and  50  ^9  ml"1  DNase  (1  h, 

22°C) .     Denatured  proteins  were   separated   by   SDS-PAGE 

(Schagger  and  von  Jagow,  1987)  and  stained  with  Coomassie 

blue. 

Results 

Cloning  of  the  ptsHIT  Operon  from  B.  stearothermophilus   XL-65- 
6 

Clones  containing  B.  stearothermophilus  XL-65-6  ptsl  gene 

were  readily  isolated  by  functional  complementation  of  a  ptsl 

mutation  in  E.     coli    MM6  provided  that  the  MacConkey  agar 

selection  plates  containing  ampicillin  were  incubated  at  30°C. 
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Initial  attempts  using  MacConkey  agar  with  glucose  or  fructose 
for  the  cloning  were  not  successful  when  the  culture  was 
incubated  at  37°C.  Plasmids  purified  from  all  six  positive 
clones  were  stable  in  E.  coli  DH5a  and  restored  the  ability  of 
MM6  to  utilize  5  PTS  carbohydrates  at  30°C.  Although  these 
MM6  recombinants  also  formed  red  colonies  at  37°C  indicating 
complementation,  growth  was  severely  limited.  Based  upon  an 
analysis  of  restriction  sites,  plasmids  in  5  clones  appeared 
identical,  with  a  DNA  insert  of  6.4  kbp,  while  the  other  was 
slightly  smaller  (5  kbp) .  One  of  the  larger  plasmids,  denoted 
pLOI1800,  was  selected  for  further  study  (Fig.  9A) . 

Transformation  of  pLOT1800  into  a  ptsH  E.  coli  mutant, 
strain  1101,  also  restored  PTS  function  of  this  mutant 
indicating  the  presence  of  ptsH   gene  in  pLOI1800. 

A  series  of  subclones  was  constructed  to  localize  these 
genes  (Fig.  9A)  .  Plasmid  pLOI1801  was  constructed  from 
pLOI1800  by  deleting  the  BamHI  to  Sad  (polylinker  region) 
fragment  and  retained  both  activities.  Both  activities  were 
lost  after  deletion  of  the  PstI  (polylinker  region)  to  SphI 
fragment  (pLOI1802) .  Plasmid  pLOI1803,  an  exonuclease  III 
deletion  of  pLOI1800,  retained  both  activities.  After 
deletion  of  the  terminal  Sail  (one  Sail  site  in  the  polylinker 
region)  fragment  in  pLOI1801,  both  activities  were  retained 
(pLOI1806) .  By  making  a  similar  Sail  deletion  in  pLOI1803  to 
produce  pLOI1804,  the  coding  regions  for  both  enzymes  were 
localized  to  the  2.3  kbp  region.   EI  activity  was  eliminated 
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by  deleting  5  kbp  from  pLOI1800  with  exonuclease  III,  although 
HPr  function  was  retained  (pLOI1805) . 

Using  the  internal  1.3  kbp  EcoRI  fragment  (Fig.  9A)  from 
pLOI1800  as  a  probe,  Southern  analysis  confirmed  that  the 
cloned  DNA  was  derived  from  B.  stearothermophilus  XL-65-6. 
This  probe  hybridized  to  a  single  band  in  each  digestion  of 
XL-65-6  genomic  DNA  but  did  not  hybridize  to  E.  coli  DNA  (Fig. 
10)  .  These  results  are  consistent  with  the  presence  of  a 
single  chromosomal  copy. 
DNA  Sequence  Analysis  and  Gene  Identification 

The  5,211  bp  region  in  pLOI1801  was  sequenced  in  both 
directions  by  using  a  combination  of  nested  deletions  and 
overlapping  subclones  (Fig.  11)  .  Five  open  reading  frames 
(ORFs)  were  found  in  the  sense  strand,  four  of  which  were 
complete.  All  appear  to  be  transcribed  opposite  to  that  of 
the  lac   promoter  on  the  vector  (Fig.  9) . 

The  central  region  of  plasmid  pLOI1801  contains  ptsH , 
ptsl     and  ptsT.  The  flanking  ORFs  were  provisionally 

identified  by  amino  acid  homology  to  known  genes  from  B. 
subtilis  and  corresponding  gene  designations  were  retained. 
The  translated  5'  incomplete  ORF  (324  amino  acids)  is  similar 
to  the  carboxyl-terminus  of  glucose-specific  EII  {ptsG1 ;  67% 
identity;  81%  similarity)  (Zagorec  and  Postma,  1992) .  The 
translated  3'  flanking  gene  (254  amino  acids)  is  similar  to 
the  amino-terminal  end  of  a  precursor  for  wall  associated 
protein  (wapA;     46%  identity  and  63%  similarity)  (Foster, 


Figure  10.  Southern  hybridization  analysis  of  ptsHIT 
operon.  Lane  assignment:  Lane  1:  A  marker;  Lane  2:  pstl 
digestion  of  E.  coli  DH5a  genomic  DNA;  Lane  3  to  Lane  7: 
digestions  of  B.  stearothermophilus  XL-65-6  genomic  DNA 
with  Clal,  EcoRI,  pstl,  Sail,  and  Smal,   respectively. 


103 


1      2     3      4      5      6     7 


Figure  11.  Nucleotide  sequence  of  B.  stearothermophilus 
DNA  fragment  in  pLOI1801.  The  deduced  amino  acid 
sequences  for  each  ORF  are  placed  below  the  first 
nucleotide  of  the  corresponding  codon.  The  putative 
promoter  region  is  underlined  and  the  -10  and  -35  regions 
labeled.  Regions  complementary  to  UUUCCUCC,  the  3'  end 
of  16s  rRNA  from  B.  stearothermophilus  (Van  Charldorp  et 
al.,  1981),  are  underlined  and  labeled  RB.  Encoded 
proteins  are  labeled  at  their  respective  start  codons. 
Stop  codons  are  indicated  by  asterisks. 
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1  GCACTTGTTGAACGTGAAAATCGGGATGACCTTCTCCGGCGGGGTCATTGACTTCTTGCT 
HLLNVKIGMTFSGGVIDFLL 
6 1  GTTTGGCGTTCTGCCAAACCGGACGGCATGGTGGCTCGTCATTCCGGTCGGCTTGGTGTT 
FGVLPNRTAWWLVIPVGLVF 
121  TGCCGTGATTTATTATTTCGGGTTCCGCTTTGCGATCCGCAAATGGGATTTGGCGACGCC 

AVIYYFGFRFAIRKWDLATP 
181  GGGCCGGGAAAAAACGGTCGAGGAAGCACCGAAAGCCGAGGCGGCCGCTGCTGGCGATCT 

GREKTVEEAPKAEAAAAGDL 
241  GCCGTACGAAGTGCTCGCCGCCCTTGGCGGAAAGGAAAACATCGAACATTTGGACGCATG 

PYEVLAALGGKENIEHLDAC 
301  CATTACGCGTTTGCGCGTTTCGGTCCATGATATCGGCCGGGTCGATAAAGACCGGTTGAA 

ITRLRVSVHDIGRVDKDRLK 
361  AGCGCTCGGCGCCGCAGGGGTGCTTGAAGTCGGCAACAACGTGCAAGCGATTTTCGGTCC 

ALGAAGVLEVGNNVQAI   FGP 
421  GAAATCGGATATGTTAAAAGGGCAAATTCAAGACATCATGCAAGGAAAAGCGCCGGCCCG 

KSDMLKGQIQDIMQGKAPAR 
481  GGCGGAAGAGAAACCGAAAACAGCGGCCTCGGAAGCCGCGGAGTCCGAAACGATCGCTTC 

AEEKPKTAASEAAESETIAS 
541  GCCGATGTCCGGAGAAATCGTTCCGCTCGCTGAAGTGCCGGACCAAGTGTTTTCGCAAAA 

PMSGEIVPLAEVPDQVFSQK 
601  AATGATGGGCGATGGGTTCGCAGTCATGCCGACGGACGGCACGGTCGTCTCTCCGGTTGA 

MMGDGFAVMPTDGTVVSPVD 
661  TGGGAAGATCATCAACGTCTTTCCGACGAAGCACGCCATTGGCATTCAGTCGGCCGGCGG 

GKIINVFPTKHAIGIQSAGG 
721  GCATGAAATTTTGATCCACGTCGGCATCGATACGGTGAAGCTGAACGGCCAAGGGTTTGA 

HEILIHVGIDTVKLNGQGFE 
781  AGCGCTTGTGAAGGAAGGCGACGAAGTGAAAAAAGGACAGCCGATTTTGCGCGTCGACCT 

ALVKEGDEVKKGQPILRVDL 
841  TGATTATGTCAAACAGAACGCCCCATCGATCGTCACACCGGTGATTTTCACGAATCTCCA 

DYVKQNAPSIVTPVIFTNLQ 
901  AGCCGGCGAAACGGTTCACGTCAACAAGCAAGGCCCGGTTGCTCGAGGAGAAGACGCTGT 

AGETVHVNKQGPVARGEDAV 
961  TGTGACGATCCGCTAAACTGCTTATTCGCGGCGGGGCGGGACCAAGCATTTTCCGTTGCC 
V   T   I   R   *  -35 

1021  TCCCCGCCCCTGCATTTGATATGATGGCAAAGAAAAAATTGAAGAAAAGGGGAATGATGA 

-10  RS 

1081  ACATGGCAGAAAAAACGTTTAAAGTCGTTTCTGATTCCGGCATCCACGCTCGTCCGGCGA 
MAEKTFKVVSDSGIHARPAT 
HPr  - 
1141  CGATTTTGGTGCAAACGGCGAGCAAATTCAACAGCGAAATCCAGCTTGAGTACAACGGCA 
ILVQTASKFNSEIQLEYNGK 
1201  AAACGGTGAACTTGAAATCGATCATGGGCGTTATGTCGTTAGGAATTCCGAAAGGGGCGA 
TVNLKSIMGVMSLGIPKGAT 
12  61  CGATCAAAATCACGGCAGAAGGGGCCGATGCGGCAGAAGCAATGGCGGCGTTAACGGATA 
IKITAEGADAAEAMAALTDT 
1321  CGTTGGCGAAAGAGGGTCTTGCAGAATAATGGGAAACGCAATCCGCGAAAAAACGATCCA 
LAKEGLAE*MGNAIREKTIH 
RS  EI  -> 

1381  TGGGATTGCTGCATCGAGCGGTATTGCCATCGCGAAGGCATACCGCTTAGAAACCCCTGA 

GIAASSGIAIAKAYRLETPD 
1441    TTTGGCAGCCGAAAAACGGACGGTCGCCGACGTCGAGGCGGAAATTGCGCGGCTTGAGGC 
LAAEKRTVADVEAEIARLEA 
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1501  GGCGGTCGCGAAAGCGAAAGAAGAGCTGGAAGCCATCAAACAGCATGCCTTGGAAAAGCT 

AVAKAKEELEAIKQHALEKL 
1561  TGGCGAAGACAAAGCTGCCATTTTTGCCGCCCACTTGCTTGTGCTTGACGACCCAGAATT 

GEDKAAIFAAHLLVLDDPEL 
1621  GCTGAACCCGATTAAAGAAAAAATCCAAACGGAGCGCGTCAATGCCGAGTACGCTCTCGA 

LNPIKEKIQTERVNAEYALD 
1681  TGAAACCGCCTTGTTTTTCATCTCGATGTTCGAGGCGATGGACAATGAATATATGAAAGA 

ETALFFISMFEAMDNEYMKE 
1741  ACGGGCCGCTGACATCCGCGATGTGACAAAGCGCGTCCTCGCCCATCTGCTTGGCGTCAC 

RAADIRDVTKRVLAHLLGVT 
1801  AATCTCGAACCCGAGCCTCATTTCTGAGGAAGTCGTGATCATCGCGGAAGACTTGACGCC 

ISNPSLISEEVVIIAEDLTP 
1861  ATCCGATACGGCGCAGCTGAACCGCCAATATGTGAAAGGATTTGCCACCGACATCGGCGG 

SDTAQLNRQYVKGFATDIGG 
1921  GCGGACGTCGCATTCAGCAATTATGGCCCGTTCGCTCGAAATCCCTGCCGTCGTCGGCAC 

RTSHSAIMARSLEIPAVVGT 
1981  GAAGACGGTGACGGCGGAAGTAAAAAACGGCGACATCGTCATTGTCGATGGGCTCGACGG 

KTVTAEVKNGDIVIVDGLDG 
2041  CCAAGTAATCATCAATCCATCGCCGGAGTTGCTCGCCCAATATGAACAAAAACGGGCGCG 

QVIINPSPELLAQYEQKRAR 
2101  CTACGAGGCGCAAAAGGCAGAATGGGCGAAACTCGTTCACGAGGCGACCGTGACGGCTGA 

YEAQKAEWAKLVHEATVTAD 
2161  CGGCATTCATGTCGAGCTGGCTGCCAACATCGGCACGCCAGACGATGTGAAAGGAGCGTT 

GIHVELAANIGTPDDVKGAL 
2221  GGCGAACGGAGCGGAAGGAATCGGATTGTATCGCACAGAATTTCTATACATGGGACGCTC 

ANGAEGIGLYRTEFLYMGRS 
2281  GGAACTGCCGACGGAAGACGAACAGTTTGTGGCTTACAAAACGGTGCTGGAACAAATGAA 

ELPTEDEQFVAYKTVLEQMN 
2341  TGGCAAGCCGGTCGTTGTGCGGACGCTTGACATTGGCGGCGACAAAGAGCTGCCGTATTT 

GKPVVVRTLDIGGDKELPYL 
24  01  GCAGCTGCCGAAAGAAATGAATCCGTTCTTAGGGTTCCGGGCAATCCGCCTTTGCCTTGA 

QLPKEMNPFLGFRAIRLCLE 
2461  GATGCAAGATATGTTCCGCACGCAGCTGCGCGCCTTGTTGCGGGCGAGTGTGTACGGCAA 

MQDMFRTQLRALLRASVYGN 
2521  TTTGAAAATCATGTTCCCGATGATTGCGACGCTCGATGAATTCCGCCAAGCGAAAGCCAT 

LKIMFPMIATLDEFRQAKAI 
2581  TTTGCTTGAAGAAAAAGAGGCGCTCCTCCGCCAAGGGGTTGCCGTTGCGGATGGAATTGA 

LLEEKEALLRQGVAVADGIE 
2641  AGTCGGCATGATGGTGGAAATTCCGGCTGCCGCCGTCATGGCGGACCAGTTTGCGAAAGA 

VGMMVEIPAAAVMADQFAKE 
2701  AGTCGATTTCTTCAGCATTGGAACGAACGACCTGATCCAATATACGATGGCGGCCGATCG 

VDFFSIGTNDLIQYTMAADR 
2761  GATGAATGAGCGGGTTGCGTATTTGTACCAGCCGTACAACCCGGCTATTTTGCGGCTCAT 

MNERVAYLYQPYNPAILRLI 
2821  CAGCCACGTGATCGATGCCGCCCATCGCGAAGGAAAATGGGTCGGGATGTGCGGGGAAAT 

SHVIDAAHREGKWVGMCGEM 
2881  GGCCGGCGACCCGATCGCGATTCCGATTTTGCTTGCTTTAGGCCTTGATGAGTTCAGCAT 

AGDPIAIPILLALGLDEFSM 
2941  GAGCGCCACTTCGATTTTGCCGGCGCGCGCCCAGTTAAAAAAGCTGGCGAAAGAAGAGGC 

SATSILPARAQLKKLAKEEA 
3001  GGCCCGCATCAAAGAGACGGTGCTGTCGCTCGGCACGGCTGAGGAAGTCGTGTCGTTTGT 

ARIKETVLSLGTAEEVVSFV 
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3061  CAAACGAACATTTTCACTTGCCTGAATGAAAGCGTGGCGACGTTTACGTTCGCCGCGCTT 

KRTFSLA*MKAWRRLRSPRF 

PtsT  ■* 
3121  TTTTCGTTTTGCGTTATGCCAACAGCGGCGTCGACGATTCATAGCCATACCGCTGCCGGA 

FRFALCQQRRRRFIAIPLPD 
3181  CCGTAAGCGAAAGGCCGATAGCGATAACTTTCGCCATTTGGTAAACAATGTGCAAATTCG 

RKRKADSDNFRHLVNNVQIR 
3241  TATACGGAAGCACGCTCATATCAGCCGTGTCCTCCATCATGACTCCCATGACGCTGATGT 

IRKHAHISRVLHHDSHDADV 
3301  CTCCGACCGGTGGCAGCGATTTGCCGAGCGCCGTGCCAGGGGAAAGGGCGCCAGGGCGGA 

SDRWQRFAERRARGKGARAD 
3  3  61  TGGCGATCGTATGGACAAACGGCTGCGGCCCGACAATGCTGTCGATTGCCACTACATATG 

GDRMDKRLRPDNAVDCHYIC 
3421  CTCCGTGTTCGTTTTGCAATTGTTCGGAAACGCGTCGGATGTTTGTCGCATCGACGGGTT 

SVFVLQLFGNASDVCRIDGL 
3481  GCCTGAGCGTGCCGATGACGGTCAAATGATCCGGATAAGCGTTTTCCAGCAGCGTGCCGA 

PERADDGQMIRISVFQQRAD 
3541  CGAACGGCCCTAAGCTGTCGCCGTTGATGCGATTGCTGCCGATGCAAACGACGGTCAGAT 

E  R  P  * 
3  601  GACGGATGTGCGCCGGCAGCAAGGAAAACAGTTCATTGCGAATGAGCAAAGGAGCGAGCG 
3661  GATCGCTGTAGGAAATCGTCCTCATCAATCGCGATTCCTTTCTGTCTATAGTCTCCTTTT 
3721  CTTATTCGCGGCCGCTGCCGGCGATTCCTTTCTTTTCAGGGTTGCAAATTTTTCCTTATG 
3781  TTTGCTTAAACGGGTTGCGCCATTCCATTAACACGGCATGGTTGAGCGATTCTTCGTCTT 
3841  CTAAACTTTCTTGCTGCACGCGGATGAGATCAGCAAAGTCGTCTGTCGTATAGTTGCGGA 
3901  TGACCGCTTCGCGCGGGGTGTTGCCGTCAAAGACGTATAAACGTTTATCGTACATGCTGA 
3961  TCCCCCTTTCGTTCGTTTTACCCTCATCATAACCGAAACCTAGAAAAAGCGCCAAGAAAG 
4021  GTCCATGGACAAGTTTGCCCTCGACCCGTCCATGGACAAGTTTGCCCTCGACCCGGGGAT 
4081  TCACGGAAAGCGGGTAGCTTGGTGTTCAAATAAAACGATCATTTACAGTCTTACGCTAAG 
4141  TGATCAAGATCACCATCGCAAGCAACCACTGGCAATCTTCGTTTAGGACTCTTTCTCCAT 
4201  TTCACATACAAATAGGTTTACTCGCAAAAGCTTCCGAAGAAAATGAAAAACCATGTCGAA 
42  61  CTTTGTCTGTCGAAAATGATGGATGGCAGGGAGCAGGAAAGATACAATTTGATTGTAAAA 
4321  GTAATGGAAATCGGGAAGGAGAGCATATGAGAAACATTCGTTTGTGATGATGGTTGGTTT 
4381  GGATGGTCGTATGGGCTTTAGTGATGCCAAGCGTCCCGACAAAGGGACTTGCCGAAACGA 
MVVWALVMPSVPTKGLAETI 
WapA'  ■* 

4441  TCCGATCAGGGATTGACCGTGTAGAGCCAAAAGAGGAGCCGCCAAAGGTGCCGCAAGCCC 

RSGIDRVEPKEEPPKVPQAP 
4501  CGAAACGTGATCTCAAGCCAGGGGAAATCATAGAAGAAAGAACTGAGAATACAAAGGTTT 

KRDLKPGEIIEERTENTKVY 
4561  ACTACAACGGCGATGGCACGTTTACCAAGAAGATTTACTTTGAGCCTATTCATGTGAAGA 

YNGDGTFTKKIYFEPIHVKK 
4621  AAAAGGGACAAAAAATATTTGAGGAAGTATCTTCTAGCCTAACGGATAGCACCAATAATA 

KGQKIFEEVSSSLTDSTNNT 
4681  CGAATTACGTTGAAACAGAGAATACCATTTTGGAGACAAACTTTTATAAAAAG  ATGGTCG 

NYVETENTILETNFYKKMVD 
4741  ATGGAGAATACGCCAATTTCCATTACAATGGATATTCGATTTCCTATTCTATTTTAGAGG 

GEYANFHYNGYSISYSILEA 
4801  CGGCAGGTAATGACGTACAATCGATAAAAGCGAAGGATGTTGCCGCCGTCTATAAGAAAA 

AGNDVQSIKAKDVAAVYKKK 
48  61  AAGACAACAAAATTCTTCACAAAAACATTTTTCCGAGCATAGACCTACAAAATATTACAT 

DNKILHKNIFPSIDLQNITF 
4921  TTAACGAGTCAACAAAAGAAGATTTGGTTCTTCATTCGGTTGGCTATCATATTTTCAAAT 
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NESTKEDLVLHSVGYHIFKF 
4981  TCCGCCTGAAAACGGATTTGCAAGCAGACATTCAAGACGATGGCTCCATTTTATTGACAA 

RLKTDLQADIQDDGSILLTN 
5041  ATCAAGAACATGAAAAGGTGTTTGAACTGCCGAAGCCGTTTATGGTGGATTCGAATGTCG 

QEHEKVFELPKPFMVDSNVD 
5101  ATGAACACTCTGGTGAAGTGCAGCGTCTCAGAGAATGTAACGTATGAATTACAACAGGAT 

EHSGEVQRLRECNV* 
5161  ATCAAGGATATATTTTAACGGTCAAAGCCGACCCTGATTGGCTGAAGGATC 
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1993) .  Although  stem  loop  structures  can  be  predicted  between  ptsG 
and  ptsH  and  between  ptsT  and  wapA' ,  none  resembles  a  rho- 
independent  terminator. 

In  the  anti-sense  strand,  6  ORFs  were  found  which  ranged 
from  140  to  317  codons  in  length.  Three  were  very  interesting. 
The  largest  ORF  extended  from  base  434  to  1381,  spanning  from  ptsl 
to  ptsG' .  Part  of  the  translated  amino  acid  sequence  for  this  ORF 
is  homologous  to  B.  subtilis  protein  K  (Gonzy-Treboul  et  al.,  1989) 
which  is  also  located  on  the  complementary  strand  of  ptsH  and  has 
been  expressed  in  vivo.  Another  interesting  ORF  extends  from  base 
1270  to  2052,  covering  part  of  ptsl  and  3'  end  of  ptsH  (262 
codons) .  This  ORF  contains  a  high  proportion  of  serine  codons 
(19.2%).  The  translated  sequence  is  similar  to  serine-rich 
proteins  such  as  the  o-agglutinin  core  subunit  from  Saccharomyces 
cerevisiae  (Roy  et  al.,  1991).  The  third  ORF  contains  184  codons 
and  extends  from  base  3134  to  3685,  covering  part  of  ptsT  and  the 
downstream  region.  Part  of  this  translated  sequence  is  homologous 
to  hypothetical  protein  7  from  B.  subtilis  (Ogasawara  and 
Yoshikawa,  1992) .  No  highly  conserved  ribosomal-binding  sites  were 
found  immediately  upstream  from  these  three  ORFs. 
The  ptsHIT   Operon 

The  three  complete  ORFs  in  the  central  region  of  the  cloned 
fragment  appear  to  comprise  an  operon,  ptsHIT.  A  putative  promoter 
is  located  36  bases  upstream  of  the  start  codon  for  ptsH.  This 
region  resembles  the  consensus  sequence  for  B.  subtilis  a* ,  a 
promoter   which   is   primarily   associated   with   constitutive 
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housekeeping  genes  (Moran,  1993) .  The  sequence  11  bp  upstream  from 
the  ptsH  coding  region  closely  resembles  the  3 '  end  of  16s 
ribosomal  RNA  from  B.  stearothermophilus  (Van  Charldorp  et  al., 
1981)  and  is  nearly  identical  to  the  proposed  ribosomal-binding 
sites  for  two  PTS  genes  from  B.  sujbtilis,  ptsH  (Gonzy-Treboul  et 
al.,  1989)  and  bgl  (Murphy  et  al.,  1984).  A  potential  ribosomal- 
binding  site  for  ptsl  is  located  within  the  carboxyl-terminus  of 
ptsH.  The  ATG  start  codon  for  ptsl  overlaps  the  TAA  stop  codon  for 
ptsH  as  had  been  previously  reported  for  the  ptsHI  operon  in  B. 
subtilis  (Gonzy-Treboul,  1989) .  A  third  ORF  designated  ptsT  begins 
immediately  adjacent  to  the  stop  codon  of  ptsl.  No  ribosomal- 
binding  site  was  readily  identified  for  ptsT. 

The  ptsH  gene  encodes  a  peptide  containing  88  amino  acids  with 
a  calculated  Mr  of  9,166.  This  translated  sequence  is  more  similar 
to  HPrs  from  other  Gram  positive  bacteria  (70.5%  to  64.8%  identity) 
than  to  HPrs  from  Gram  negative  bacteria  (42.0%  to  31.8%)  (Table 
12)  .  Three  highly  conserved  regions  were  found  which  are  identical 
in  all  Gram  positive  HPrs  in  the  current  GenBank  data  base  (Fig. 
12A)  .  The  first  spans  Gly13  to  Thr20  and  includes  His15  which  serves 
as  the  phosphoryl  acceptor  for  EI  (Weigel  et  al.,  1982).  The 
second  extends  from  Leu22  to  Lys28.  The  third  contains  13  amino 
acids  (Val42  to  Gly")  and  includes  Ser46,  a  site  which  can  be 
phosphorylated  by  an  ATP-dependent  kinase  in  other  Gram  positive 
bacteria  (Postma  et  al.,  1993). 

Patterns  of  codon  usage  are  remarkably  different  for  B. 
stearothermophilus   and  B.    subtilis   HPrs  although  the  amino  acid 
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Table  12 .  Comparison  of  predicted  amino  acid  sequences  of  ptsH   and 
ptsl   from  B.   stearothermophilus   to  those  from  other  bacteria 


Proteins 


%  Identity  f%  Similarity) 
Bst-HPr      Bst-EI 


Accession 
Number8 


HPr 

Gram  positive  organisms 


B.    subtil  is 

E.    faecalis 

i 

S.   mutans 

i 

S.    salivarius 

i 

S .   carnosus 

i 

S.   aureus 

i 

Gram  neaative 

oraanisms 

M.  capricolum 

A.   eutrophus 

E.   coli 

S.    typhimurium 

K.    pneumoniae 

Enzyme  I 

Gram  positive 

orcramsms 

B.   subtilis 

S .    carnosus 

S.   mutans 

S.    salivarius 

Gram  neaative 

oraanisms 

70.5(84.1) 
68.2(79.5) 
67.8(81.6) 
65.5(79.3) 
64.8(80.7) 
62.1(78.2) 


42.0(60.2) 
36.8(59.8) 
32.9(48.2) 
32.9(48.2) 
31.8(48.2) 


S.  typhimurium 

E.   coli 

A.    eutrophus 


76.5(87.4) 
63.7(77.0) 
63.7(76.8) 
63.5(76.4) 


X12832 
A25053 
L15191 
Z17217 
X60766 
P02907 


L22432 
M69036 
J02796 
M21450 
X51452 


M98359 
M69050 
L15191 
M81756 


50.3(69.8)  M76176 
49.9(69.5)  J02796 
40.0(61.6)    M69036 


GenBank  accession  number 


Figure  12 .  Comparison  of  deduced  amino  acid  sequences  of 
ptsH  and  ptsl  from  B.  stearothermophilus  and  from  other 
Gram  positive  bacteria.  Regions  identical  in  HPrs  from 
4  or  more  organisms  are  indicated  by  shadowing.  Regions 
identical  in  Els  from  3  or  more  organisms  are  indicated 
by  shadowing.  Gaps  (dashes)  have  been  introduced  to 
optimize  alignment. 
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A.  HPr 

Bst  MAEKTFKWSDSGIHARPATILVQTASKFNSElQLEYNGKTVSl^SIMGVMSLGIPKeftT 
Bsu  MAQKTFKVTADSGIHARPATVLVQTASKYDADVNLEYNGKTVHLKSIMGVMS1XIAKGAE 
Efa  MEKKEFHIVAETGIHARPATIXVQTASKFNSDINLEYKGKSVNLKSIMGVMSLGVGQGSD 
Smu  MASKDFHIVAETGIHARPATLLVQTASKFASDITLDYKGKAVNtKSIMGVMSLGVGQGAD 
Ssa  MASKDFHIVAETGIHARPATLLVQI'ASKFASDITLDYKGKAVNLKSIMGVMSLGVGQGAD 
Sea  MEQQSYTIIDETGIHARPATHWQTASKFDSDIQLEYNGKKVHLKSIMGVMSLGVGKDAE 
Sail  MEQNSYVIIDETGIHARPATMLVQTASKFDSDIQLEYNGKKVKLKSIMGVMSLGVGKDAE 

Bst  IKITAEGADAAEAMAALTDTLAKEGIiAE- 
Bsu  ITISASGADENDALNALEETMKSEGLGE- 
Efa  VTITVDGADEAEGMAAIVETLQKEGLAEQ 
Smu  VTITAEGADADDAIAAINETMTKEGIA-- 
Ssa  VTISAEGADADDAIVAIAETMTKEGLA — 
Sea  ITIYADGSDEADAIQAITDVLSKEGLTI- 
Sau    ITIYADGSDESDAJQAiSDVLSKEGLT— 


B.  EI 

Bst   MGNAIREKTIHGIAASSGIAIAKAYRLETPDLAAEKRTVADVEAEIARLEAAVAKAKEEL 

BSU  M QELKGIGASAGIAIAKAYRtEEPDLTVEKKNISDSEAEVSRFDEAIARSKEEL 

Sea   MA KQIKGIAASDGVAIAKAYLLVEPDLSFDNESVTDTDAEVAKFNGALNKSKVEL 

Smu  MTEM LKGIAASDGVAVAKAYLLVQPDLTFETVSVTDTQAEEARLDAALEASQNEL 

Ssa   MTEM LKGIAASDGVAVAKAYLI.VQPDLSFETVTVEDTSAEEARLDAAIAASQDEL 

Bst    EAIKQHALEKLGEDKAAIFAAHLLVLDDPELLNPIKEKiQTERVWAEYACDETALFFISM 

Bsu  EKIKEHALKELGQ — — PVKEKISTDSVNAEFALKETSSMFVTM 

Sea  TKIRNNAEKQLGADKAAIFDAHLLVLEDPELIQPIEDKIKNESVSAAQAliTDVSNQFITI 
Smu  SLIRQKAVDTIGEEAAAVFISAHLMVLADPEMIGQIKEtlRTKEVSAESAlKEVTDMFVfL 
Ssa   SVIREKAVESLGEEAAAVFDAHLMVIADPEMTGQIKETIRAKQVNAEAAtTEVTDMFIAI 

Bst  FEAMDNE-YMKERA-ADIRDVTKRVIAHIil.GVTISNPSLISEEWIIAEDLTPSDTAQUI 

Bsu  FESMDNE-YMKERA-ADIRDVTKRVTGHLLGVEIPNPSMISEEVIIVAEDLTPSDTAQLN 

Sea  FESMDNE-YIAERAKADIRDVSKRVLAHILGVELPNPSIVDESWIIGNDLTPSDTAQIiN 

Smu  FENMEDNPYMQERA-AD1RDVAKRVIAHLLGVELPNPATISEESIVIAHDLTPSDTAQLD 

Ssa  FEGMEDNPYMQERA-ADIRDVtKRVLANrjDGKra,PNPATINEESIWAHDLTPSDTAQtD 

Bst  RQYVKGFATDIGGRTSHSAIMARSLEIPAWGTKTVTAEVKNGDIVIVDGLDGQVIINPS 
Bsu  REFVKGFTTDIGGRTSHSAIMARSLEIPAWGTKAATGTIQNGVTVIVDGINGDVIIDPS 
Sea  KEYVQGFVTNIGGRTSHSAIMSRSLEIPAWGTKSITEEVEAGDTIWDGMTGDVLINPS 
Smu  ANYVKAFVTNIGGRTSHSAIMARTLEIAAVLGTNDITERVKNGDIVAVNGITGQVIINPT 
Ssa   KKYVKAFVTNIGGRTSHSAIMARTLEIAAVLGTNNITELVKDGDILAVSGITGEWINPT 

Bst  AELLAQYEQKRARYEAQKAEWAKLVHEATVTADGIHVELAANIGTPDOVKGALAMGAEGI 
Bsu  AETVKEYEEKHNAYLAQKAEWAKLVNEPTVSKDGHHVELAANIGTPDDVKGVLEHGGEAV 
Sea  DEVIAEYQEKRENFFKDKQELQKLRDAESVTADGHHVELAANIGTPNDLPGVI ENGAEGI 
Smu  EDQIAEFKAAGETYAKQKAEWALLKDAETVTADGKHFELAANIGTPKDVEGVNNNGAEAV 
Ssa    EEQ1AEFKAAGEAYAKQKAEWALLKDAQTVTADGKHFELAANIGTPKDVEGVKDNGAEAV 
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Bst  GLYM'EFLVMGRSELPfEbEQFVAVlCrVLEQMNGKPVVVRTLDIGGDKELPYLQLPKEMN 

Bsu  GLYRTEFLYMGRDQLP^DEQFDAyKTVl^I^EGKSVWRTLDIGGDKELPytQLPKEMN 

Sea  GLYRTEFLYMGRDQMPTEEEQFEAYKAVLEAMKGKRVWRTLDIGGDKELPYLDLPEEMN 

Smu  GLYRTEFLYMDSQDFPTEDEQYEAYKAVtEGMNGiaPVWRTMDIGGDKELPYFDUPKEMN 

Ssa  GLYRTEFLYMDSQDFPTEDDQYEAYKAVLEGMNGKPWVRTMDlGGDKELPYFDLPKEMN 

Bst  PFLGFRAIRLCLEM — QDMFRTQLRALLRASVYGNLKIMFPMIATLDEFRQAKAILLEEK 

Bsu  PFLGYRAIRLCLEE— QEIFRTQLRAIiLRASTYGNLKIMFPMIATVNEFKEAKAILLEEK 

Sea  PFLGYRAIRLCLDQ — PEIFRPQLRALIRASVFGKLNIMFPMVATIQEFRDAKAFLKKNV 

Smu  PFLGFRALRI S I SETGNQMFRTQLRAtLRASVHGQtRlMFPMVALLNEFRKAKGILEEEK 

Ssa  PFLGYRALRISISETGNQMFRTQIJRALLRASVHGKLRIMFPMVALLTEFRTAKGILEEEK 

Bst  EALLRQGVAVADGIEVGMMVEIPAAAVMADQFAKEVDFFSXGTNDLIQYTMAADRMNERV 
Bsu  EKLVKAGQAVSDDIEVGMMVEIPSTAVIADQFAKEVDFFSiGTNDLIQYTMAADRMKERV 
Sea  LTLKMKAMKVADDIELGIMVEIPSTAALADIFAKEVDFFSIGTNDLIQYTMAADRMSERV 
Smu  ANLKAEGVAVSDDIQVGIHIEIPAAAM1ADQFAKEVDFFSIGTNDLIQYTMAA0RMNEQV 
Ssa  AKLVAEGVAVADDIEVGIMIEIPAAAMLADQFAKEVDFFSIGTNDLIQYTMAADRMNEQV 

Bst  AYLYQPYNPAILRLISHVIDAAHREGKWVGMCGEMAGDPIAIPIIiALGLDEFSMSATSI 
Bsu  SYLYQPYNPAILRLITLVIEAAHKEGKWVGMCGEMAGDEIAIPILLGI^LDEFSMSATSI 
Sea  SYLYQPYI SNFSFLVKQVIEASHAEGKWTGMCGEMAGDQTAIPLLLGLGLDEFSMSATS I 
Smu  SYLYQPYNPSIURLVDHWKAAHAEGKWAGMCGEMAGDQTAVPI.IjVGIGLDEFSMSATSV 
Ssa    SYLYQPYNPSILRLINNVIKAAHAEGKWAGMCGEMAGDQTAVPI,LVGMGLDEFSMSATSV 

Bst  LPARAQLKKLAKEEAARIKETVtS-LGTAEEVVSFVKRTFSLA 
Bsu  LPARTQISKLSKQEAESFKEKILS-MSTTEEWAFVKETFK — 
Sea  LKARVLIRSLNESEMKELSERAVQ-CATSEEWDLVEEYTKNA 
Smu  LRTRSLMKKLDTAKMQEIAQRAkTECATMEEVLEtEKEYIDFD 
Ssa    LRTRSLMKKLDTAKMEEYANRALTECSTMEEVLELSKEYVNVD 
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sequences  are  very  similar  (70%  identity) .  Of  28  identical  amino 
acids  in  the  three  perfectly  conserved  regions,  15  are 
translated  from  different  codons  resulting  from  17  base  changes. 
The  replacement  of  A  and  T  in  B.  subtilis  with  G  or  C  in  B. 
stearothermophilus  accounts  for  82%  of  these  changes.  However, 
base  changes  for  the  remaining  aligned  amino  acids  appear  to  be 
quite  random  with  a  48%  frequency  of  replacement  of  A  and  T  for  G 
and  C  (Fig.  13) . 

The  ptsl  gene  encodes  578  amino  acids  (Mr  63,470).  Although 
this  gene  encodes  a  protein  which  is  very  similar  to  enzyme  I  from 
B.  subtilis  (Table  12  and  Fig.  12B) ,  the  translated  sequence  for 
B.  stearothermophilus  ptsl  contains  considerably  less  serine  and 
threonine  with  correspondingly  higher  levels  of  alanine  and 
leucine.  As  with  ptsH,  the  translated  amino  acid  sequence  of  ptsJ 
from  B.  stearothermophilus  is  more  similar  to  Gram  positive 
bacteria  (7  6.5%  to  63.5%  identity)  than  to  Gram  negative  bacteria 

(50.3%  to  40.0%  identity)  (Table  12).  Highly  conserved  regions  in 
EI  are  scattered  within  the  middle  of  the  polypeptide,  the  longest 
being  25  amino  acids.  Both  amino-  and  carboxyl -terminal  regions 
are  less  conserved  (Fig.  12B)  .  It  is  interesting  to  note  that  the 
B.  stearothermophilus  EI  also  shares  partial  homology  with 
pyruvate  phosphate  dikinases  from  eukaryotes  and  prokaryotes 

(Bruchhaus  and  Tannich,  1993;  Pocalyko  et  al.,  1990)  and  with  E. 
coli   PEP  synthase  (Niersbach  et  al.,  1992). 

The  ptsT   gene  potentially  encodes  155  amino  acids.   No 
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significant  matches  to  the  translated  sequence  for  this  protein 
were  found  in  the  current  GenBank  data  base. 

General  characteristics  of  the  ptsH,  ptsl ,  and  ptsT  products 
are  listed  in  Table  13.  The  isoelectric  points  for  HPr  and  EI 
were  5.9  and  4.9,  respectively,  higher  than  reported  for  other 
Gram  positive  organisms  which  had  pi  values  ranging  from  pH  4 . 15 
to  pH  4.6  for  HPrs  and  from  pH  4.40  to  pH  4.63  for  Els.  Both  B. 
stearothermophilus  HPr  and  EI  had  fewer  negatively  charged  amino 
acid  residues.  The  predicted  pi  for  the  ptsT  product  was 
surprisingly  high,  pH  11.7.  The  translated  ptsT  gene  contained  a 
high  proportion  of  basic  amino  acids  (27%) ,  Arg  alone  comprising 
19%. 
Temperature-Induced  Hyperexpression  of  ptsHI   in  E.   coli 

Expression  of  the  ptsHIT  operon  in  E.  coli  JLT2  (ptsl  mutant) 
was  examined  in  cells  grown  in  Luria  broth  at  30CC  and  37°C  using 
denaturing  gels  (Fig.  14) .  Two  new  protein  bands  were  observed  in 
separated  extracts  of  JLT2 (pLOI1801)  and  JLT2 (pLOI1804)  which 
correspond  in  size  to  HPr  and  EI,  apparent  Mr  of  10,000  and 
64,000,  respectively  (Fig.  14).  No  new  bands  were  observed  which 
corresponded  to  the  pstT  although  expression  at  a  modest  level 
could  easily  be  obscured  by  native  proteins  of  similar  size.  With 
both  pLOI1801  and  pLOI1804,  HPr  and  EI  levels  were  estimated  to  be 
at  least  5-fold  higher  in  cells  grown  at  37°C  than  in  cells  grown 
at  30°C. 
Polar  Mutation  in  ptsH 

A  SacI  linker  (pCGAGCTCG)  was  inserted  into  plasmid  pLOI1806 


119 


I 

01 
•C 
■U 

o 

in 
m 


C 

o 

u 
(11 
a 
o 


■o 
5. 


0) 

■p 
o 

0 

M 

10 
0 

U 


10 
Eh 


0) 

£ 


O  T3 


O 

o 


0) 

a 

■o 

O  -P 

o 

•   0 
O  H 

z  o 

3 
C 


o 
M 

ft, 


oi  o\  r- 

in  ^h 


Hnifl 

I    CM  H 

I     + 


t£ 

o 

CO 

ID 

f> 

CM 

H 

1 

«J 

01 

n 

CO 

^D 

H 

CO  CO  ID 

co  r-  in 
in  h 


r~  r~  CO 
ornu 
cm  r-  t 


Eh 
U  w 
0.  H  H 
X  W  O, 


to  o  F  a  t=  i 

Ov  3  3  (0  _  <a 

•  rt  a      o  ti 

(»  a  3  Hi  fj 

°l  H"      °  —  • 
P3  h  rfO 
1-3  fl>   3   H 

w  (o  rt  a  9.  " 

*j  ft  fi  O  • 

U>         ft        ff  SI 

£•0    P.  |  » 

•  3  3  h  h-  i 

01         «   3   O 

■8  9  2.5.    fi 

tr1  C  ft  &  hK 
M  ft        „   3   H 

H  6  «  a  ■  B 
o  ro  o  — .  rt 
SB  ■{SB 

,.  Ifl    O    ■    O    Hi 

H   (0   „   01   H 

*.  O   H  3   H-  O 

£  3  01  ft  H- 

io  ^ 

B>  ifl   ft  3   f 
Cj  rt  H  u  (0  Hi 

M         O    H  IO 

S  "  *   go 

»  3  o  o  5  w 

Ot  01  m  3 
-J  01  PI  H-  rt- 
w  "  3   rt  C  01 

a  f-  oi  • 
o  a 
i-1  H  3  a 

«>  P>   0!         > 

ai  3       pj  »o 

-  (0   O   01  "O 
01   Mi        H 

B      _  ■  o 

C  Ul  3!  H-  !< 
HI         >0  N    H- 

m  rt  n  o  3 

-^3"  pi 

■O  H  pi  3  rt 

G  O   3   B>  (D 

O  C   tt  H  M 

MffWIH 

—  H  H    W 
-.    CO  •      01    Ol 


121 


3 


i 
V 


♦«!«♦ 


8       9* 


122 

(Fig.  9B)  at  the  Hpal    site  of  ptsH    to  create  a  frame-shift 

mutation  (pLOI1807) .    This  frame-shift  mutation  replaced  the 

carboxyl-terminal  12  amino  acids  with  a  new  45  amino  acid  segment. 

After  transformation  into  1101  (ptsH    mutant)  and  MM6  (ptsl 

mutant) ,  recombinants  were  tested  on  carbohydrate-MacConkey  agar 

plates.     Strain   1101(pLOI1807)   was   negative,   as   expected. 

MM6(pLOI1807)  exhibited  weak  enzyme  I  activity  at  37°C  (light  pink 

colonies)  but  was  negative  at  30°C  indicating  that  functional 

expression  of  ptsJ  was  greatly  reduced  or  eliminated  by  the 

upstream  frame-shift  mutation  in  ptsH.     However,  no  protein  bands 

corresponding  to  HPr  or  EI  were  observed  following  SDS-PAGE  of 

extracts  from  recombinants  containing  pLOI1807  (Fig.  14) . 

Growth  Inhibition  of  E.   coli  ptsl  Mutants  by  B.  stearothermophilus 
ptsHIT   Operon 

In  the  initial  screening,  toxicity  problems  associated  with 

the  B.     stearothermophilus    DNA  containing  ptsHIT    prevented  the 

recovery  of  positive  clones  by  complementation  of  an  E.   coli  ptsl 

mutant  (strain  MM6)  at  37°C  but  not  at  30°C.   MM6  recombinants 

(pLOI1800,  pLOI1801)  harboring  this  operon  grew  well  on  PTS  sugars 

at  30°C  but  formed  small  red  colonies  at  37°C.   Although  DH5a 

(ptsl*)    was  not  inhibited  by  the  presence  of  plasmids  containing 

ptsHIT   at  either  temperature,  two  additional  E.   coli  ptsl   mutants 

(JLT2  and  BL21)  were  also  inhibited  by  this  operon  at  37°C. 

Toxicity  of  the  ptsHIT   operon  at  37°C  in  E.   coli  ptsl   mutants  was 

not  limited  to  a  particular  medium  and  did  not  require  the 

addition  of  carbohydrate  since  similarly  poor  growth  was  observed 

with  Luria  agar  lacking  added  sugar.  This  toxicity  was  eliminated 
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by  insertion  of  a  frame  shift  mutation  in  ptsH    (pLOI1807)  which 

also  decreased  the  expression  of  ptsl   (Fig.  14) .  Toxicity  of  this 

operon  was  also  eliminated  by  deleting  the  ptsT  gene  (pLOI1803  and 

pLOI1804)  without  affecting  the  ability  of  recombinants  to  produce 

high  levels  of  functional  EI  and  HPr  at  30°C  and  37°C  (Fig.  14)  . 

Construction  and  testing  of  plasmid  PLOI1808  containing  the 
complete  cellobiose  PTS  system  from  B.  stearothermophilus  XL-65-6 

The  3.8  kbp  DNA  fragment  containing  B.  stearothermophilus  XL- 
65-6  eel  operon  was  isolated  from  pLOI902  (includes  functional 
promoter)  by  digestion  with  Mlul  and  Wsil.  This  fragment  was 
inserted  between  the  PstI  (polylinker)  and  Mlul  sites  of  pLOI1803 
which  contains  the  XL-65-6  ptsHI  genes  (Fig.  9B) .  The  resulting 
plasmid,  pLOI1808,  hydrolyzed  cellobiose  analogues  (4- 
methylumbelliferyl-B-D-glucopyranoside  and  4-methylumbelliferyl-B- 
D-cellobiopyranoside)  and  complemented  ptsH  and  ptsJ  mutations  in 
E.   coli. 

On  MacConkey  agar  plates  containing  cellobiose  and  incubated 
at  37°C,  DH5a(pLOI1808)  formed  pink  colonies  which  were  darker 
than  transformants  with  either  of  the  original  plasmids  (pLOI902 
or  pLOI1803) .  Controls  with  pUC18  alone  formed  light  yellow  zones 
due  to  an  increase  in  pH  from  amino  acid  catabolism. 

Attempts  were  made  to  improve  the  cellobiose  utilization  of 
DH5a(pLOI1808)  by  mutagenesis.  Crystals  of  nitrosoguanidine  were 
added  to  the  center  of  cellobiose-MacConkey  agar  plates  which  had 
been  spread  with  approximately  2,000  CFU  of  DH5a (pLOI1808) .  After 
incubation,  sixteen  colonies  were  recovered  which  appeared  more 
intensely  pink.  Plasmids  were  purified  from  these  colonies, 
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transformed  back  into  fresh  DH5a,  and  recombinants  tested.  A 
mixture  of  colony  types  was  obtained  with  each  plasmid 
preparation.  Most  transformants  appeared  similar  in  color  to 
MM6(pLOI1808)  while  others  were  white  indicating  loss  of  function. 
However,  all  contained  a  low  level  of  intensely  red  colonies. 
None  of  these  red  colonies  were  stable  during  subculturing  and 
none  were  able  to  grow  on  cellobiose-minimal  agar.  Further 
selection  was  attempted  in  minimal  broth  containing  cellobiose  but 
without  success. 

Discussion 

The  ptsHIT   Qperon 

To  our  knowledge,  this  is  the  first  time  that  ptsH  and  ptsX 
have  been  described  from  a  thermophilic  bacterium.  In  B. 
stearothermophilus,  these  two  genes  form  an  operon  which  includes 
a  third  gene,  ptsT.  The  three  genes  in  this  operon  may  be 
translationally  coupled  because  the  last  base  of  stop  codon  for 
ptsH  is  overlapped  by  the  first  base  of  start  codon  for  ptsJ  and 
the  stop  codon  for  ptsJ  is  adjacent  to  the  start  codon  for  ptsT. 
In  recombinant  E.  coli,  a  mutation  in  the  first  gene  dramatically 
decreased  the  expression  of  the  downstream  ptsl  consistent  with 
transcription  as  a  single  mRNA  unit  in  E.  coli  and  presumably  also 
in  B.  stearothermophilus .  Although  no  potential  rho- independent 
terminators  were  found  downstream  from  the  ptsHIT  operon,  the  long 
ORF  in  the  antisense  strand  may  serve  this  function  in  B. 
stearothermophilus.  Antisense  ORFs  at  the  ptsHI  locus  from  E. 
coli    (Levy  et  al.,  1989)  and  from  B.  subtlis    (Gonzy-Treboul  et 
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al.,  1989)  have  been  expressed.  Levy  et  al.  (1989)  proposed  that 
antisense  transcription  could  mediate  a  premature  termination  of 
transcription  at  the  ptsH-ptsI   locus  of  E.   coli. 

The  translated  sequences  for  ptsH  and  ptsj  from  B. 
stearothermophilus  shared  extensive  homology  with  the 
corresponding  gene  products  from  other  bacteria.  Though  clearly 
related,  homologues  from  gram  positive  bacteria  were  more  similar 
to  each  other  than  to  homologues  from  gram  negative  bacteria. 
Translated  amino  acid  sequences  for  B.  stearothermophilus  ptsH  and 
ptsl  were  most  like  those  of  B.  subtilis  but  contained  higher 
levels  of  alanine  and  leucine,  with  corresponding  reductions  in 
threonine  and  serine.  This  shift  in  amino  acid  composition  may  be 
related  to  thermal  tolerance  (Mozhaev  and  Martinek,  1984). 

The  translated  sequence  for  the  ptsT  gene  was  not  similar  to 
any  other  entries  in  the  current  GenBank  data  base,  although  the 
size,  high  proportion  of  basic  amino  acids  (Arg,Lys,His)  and  high 
pi  resemble  some  bacterial  histone-like  proteins  (Drlica  and 
Rouviere-Yaniv,  1987).  The  ptsT  gene  was  not  essential  for 
functional  complementation  of  E.  coli  strains  containing  mutations 
in  either  ptsH  or  ptsJ  mutations.  No  third  gene  is  present 
immediately  adjacent  to  ptsl  in  the  B.  subtilis  operon  although  a 
third  ORF  has  been  reported  much  further  downstream  (Reizer  et 
al.,  1993). 
Toxicity  Associated  with  the  ptsT   Gene 

Clones  containing  the  B.  stearothermophilus   ptsHIT   operon 
were   readily   obtained   from   gene   libraries   by   functional 
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complementation  of  an  E.  coli  ptsl  mutant  provided  the  initial 
selection  was  conducted  at  30°C  rather  than  37°C.  The  ptsT  gene 
was  identified  as  the  basis  for  toxicity  at  37°C.  Much  higher 
levels  of  the  ptsH  and  ptsl  gene  products  were  present  in 
recombinant  E.  coli  grown  at  37°C  than  at  30°C.  Deletion  of  the 
ptsT  gene  eliminated  toxicity  without  decreasing  expression  of  the 
recombinant  ptsH  and  ptsl  genes.  A  frame-shift  mutation  in  ptsH 
which  blocked  the  production  of  recombinant  HPr  also  decreased  the 
expression  of  ptsl  (downstream)  (and  presumably  ptsT)  and 
eliminated  toxicity. 

Although  the  role  of  ptsT   is  unknown,  it  is  interesting  to 
note  that  a  third  gene  of  similar  size  is  also  present  at  the  end 
of  the  operon  containing  ptsH   and  ptsl   in  E.    coli   and  Salmonella 
typhimurium,    the  err   gene  (Postma  et  al.,  1993).   The  importance 
of  the  PTS  and  the  err  gene  product  for  catabolite  repression  in 
enteric  bacteria  is  well  established  (Mitchell  et  al.,  1989; 
Postma  et  al.,  1993).    Although  the  mechanisms  differ,  PTS 
components  also  regulate  sugar  utilization  in  gram  positive 
bacteria  (Postma  et  al.,  1993).    Since  the  toxicity  of  B. 
stearothermophilus  ptsT   gene  was  limited  to  E.    coli  ptsl   mutants, 
ptsT    products  are  presumed  to  interfere  with  the  regulatory 
aspects  involving  PTS  components  in  E.    coli.       It  is  tempting  to 
speculate  that  ptsT    may  have  some  regulatory  function  in  B. 
stearothermophilus . 
Construction  of  a  Recombinant  Cellobiose  PTS 

Previous  studies  in  our  laboratory  have  attempted  to  develop 


127 
a  functional  hybrid  cellobiose  uptake  system  in  E.  coli  by 
combining  the  cellobiose-specif ic  PTS  genes  (celABCD)  from  B. 
stearothermophilus  with  the  E.  coli  general  proteins  (pstHI) . 
Although  function  could  be  demonstrated  with  fluorescent  analogues 
of  cellobiose,  the  utilization  of  cellobiose  was  poor.  To 
eliminate  the  potential  problem  of  incompatibility  between 
heterologous  transport  proteins,  we  constructed  a  plasmid  which 
included  the  full  set  of  B.  stearothermophilus  genes  needed  for 
cellobiose  transport  and  cleavage.  This  combination  of  genes 
facilitated  a  low  level  of  cellobiose  utilization  as  indicated  by 
acidification  of  cellobiose-MacConkey  plates  but  was  insufficient 
to  support  growth  on  cellobiose  in  minimal  medium.  Attempts  to 
improve  cellobiose  utilization  by  plasmid  mutagenesis  were  of 
limited  success.  Mutants  appear  to  be  selectable  but  are 
unstable.  Clearly  the  B.  stearothermophilus  HPr  and  EI  proteins 
functioned  well  with  the  native  sugar  specific  PTS  permeases  in  E. 
coli.  Thus  the  inability  of  these  recombinant  E.  coli  to  utilize 
cellobiose  effectively  may  reside  in  the  formation  of  an  active 
EIICel  complex.  it  is  possible  that  effective  assembly  of  the 
EII e  is  hindered  by  differences  in  membrane  composition  or  that 
higher  temperatures  are  reguired  for  EIICel  proper  assembly  and 
function. 


CHAPTER  6 
SUMMARY  AND  CONCLUSIONS 


Endospore-forming  thermophiles  were  readily  isolated  from 
diverse  natural  samples  by  using  ethanol  to  kill  vegetative 
cells  and  to  enrich  for  the  endospores,  using  nutrient  medium 
with  incubation  at  high  temperature.  Thermophiles  are 
abundant  in  all  samples  screened.  All  the  colonies  were 
either  Bacillus  species  or  actinomycetes.  About  one  fourth  of 
the  Bacillus  strains  isolated  by  incubating  at  55°C  could  also 
grow  at  over  65°C,  and  the  remaining  three  fourths  could  only 
grow  at  55°C  or  lower  temperature.  The  former  group  was 
identified  as  B.  stearothermophilus  and  the  latter  as  B. 
coagulans  by  using  methods  of  conventional  taxonomy.  All 
strains  isolated  from  the  65°C  incubation  temperature  could 
grow  at  over  65°C  and  were  identified  as  B. 
stearothermophilus . 

A  genomic  library  was  constructed  in  E.  coli  from  B. 
stearothermophilus  strain  XL-65-6  chromosomal  DNA.  This 
organism  exhibited  strong  activities  with  cellobiose  model 
substrates,  4-methylumbelliferyl-B-D-glucopyranoside  (MUG)  and 
4-methylumbellif eryl-B-D-cellobiopyranoside     (MUC) . 
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Recombinant  E.  coli  clones  degrading  these  model  substrates 
were  readily  isolated  and  contained  genes  encoding  the 
cellobiose-specific  permease  of  the  phosphoenolpyruvate- 
dependent  phosphotransferase  system  (PTS) .  Both  MUG  and  MUC 
activities  were  present  together  and  both  activities  were  lost 
concurrently  during  subcloning  experiments.  A  functional  E. 
coli  ptsl  gene  was  required  for  MUC  and  MUG  activities.  The 
DNA  fragment  from  B.  stearothermophilus  contained  four  open 
reading  frames  which  appear  to  form  a  PTS  eel  operon. 
Intergenic  stop  codons  for  celA,  celB,  and  celC  overlapped  the 
ribosomal-binding  sites  of  the  respective  downstream  genes. 
Frame-shift  mutations  or  deletions  in  celA,  celB,  and  celD 
were  individually  shown  to  result  in  a  loss  of  MUC  and  MUG 
activities.  Based  upon  amino  acid  sequence  homology  and 
hydropathy  plots  of  translated  sequences,  celA  and  celB  were 
identified  as  encoding  PTS  EIIB  and  EIIC,  and  celD  encoding 
PTS  EIIA.  These  translated  sequences  were  remarkably  similar 
to  their  respective  E.  coli  homologues  for  cellobiose 
transport.  No  reported  sequences  exhibited  a  high  level  of 
homology  with  the  celC  gene  product.  The  predicted  carboxy- 
terminal  region  for  celC  was  similar  to  the  corresponding 
region  of  E.  coli  celF,  a  phospho-6-glucosidase.  An 
incomplete  regulatory  gene  (celJ?)  and  proposed  promoter 
sequence  were  located  upstream  of  the  proposed  eel  operon.  A 
stem-loop  resembling  a  rho-independent  terminator  was  present 
immediately  downstream  from  celD.     These  results  indicate  that 
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B.  stearothermophilus  XL-65-6  contains  a  cellobiose-specif ic 
PTS  for  cellobiose  transport.  Similar  systems  were  also  found 
in  B.  subtilis  and  may  be  common  in  other  Gram  positive 
bacteria.  The  eel  operon  may  be  controlled  by  a  mechanism  of 
transcription  antitermination. 

Genes  encoding  two  general  proteins  of  PTS,  HPr  (ptsH) 
and  enzyme  I  (ptsl) ,  were  also  cloned  and  sequenced  from  B. 
stearothermophilus  XL-65-6.  Both  were  functionally 
overexpressed  in  E.  coli.  These  two  genes  along  with  the 
third  gene,  ptsT ,  form  an  operon,  which  is  flanked  by  ptsG 
(encoding  glucose  permease)  on  the  upstream  and  wapA  (encoding 
wall  associated  protein)  on  the  downstream.  The  ptsH  consists 
of  267  nucleotides  and  encodes  a  protein  of  88  amino  acids 
with  a  Mr  9,166;  the  ptsJ  consists  of  1737  nucleotides  and 
encodes  a  protein  of  578  amino  acids  with  a  Mr  63,470;  and  the 
ptsT  consists  of  468  nucleotides  and  encodes  a  protein  of  155 
amino  acids  with  a  Mr  18,428.  Both  HPr  and  enzyme  I  are  most 
similar  to  corresponding  enzymes  from  B.  subtilis.  The 
translated  ptsT  does  not  exhibit  significant  similarity  to  any 
genes  in  the  current  data  base.  These  three  genes  may  be 
translated  from  a  common  RBS  which  is  located  at  upstream  of 
ptsH.  A  polar  mutation  in  ptsH     almost  abolished  the 

expression  of  the  other  two  genes  in  E.  coli.  The  expression 
of  this  operon  in  E.  coli  at  37°C  is  higher  than  that  at  30°C. 
ptsT  gene  product  may  function  as  an  expression  or  activity 
enhancer,  but  its  exact  role  is  unknown.   No  rho-independent 
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terminators  were  found  in  the  regions  adjacent  to  the  ptsHIT 
operon.  A  transcription  termination  factor  may  be  involved. 
Southern  analysis  indicates  that  this  operon  exists  as  a 
single  copy  in  genome  of  B.  stearothermophilus . 

Cellobiose  utilization  by  E.  coli  recombinants  harboring 
the  B.  stearothermophilus  eel  operon  was  poor  (slight 
acidification  of  cellobiose-MacConkey  agar)  although  they  were 
easily  identified  using  chromogenic  cellobiose  analogues. 
Improved  cellobiose  utilization  was  obtained  by  constructing 
a  plasmid  containing  both  B.  stearothermophilus  eel  and  ptsHI 
operons.  However,  the  growth  of  this  recombinant  in  M9 
minimal  medium  containing  cellobiose  as  sole  carbon  source  was 
not  significant.  The  inability  to  grow  in  this  medium  may 
result  from  low  enzymatic  activities  of  the  B. 
stearothermophilus  gene  products  since  the  ptsH  and  ptsl 
products  were  abundant. 

A  general  model  for  initiating  cellobiose  utilization  in 
B.  stearothermophilus  has  been  proposed  (Fig.  15).  This 
organism  uses  PTS  for  cellobiose  uptake  from  the  environment 
and  to  initiate  its  metabolism.  Genes  encoding  the 
cellobiose-specif ic  permease  form  an  operon,  consisting  of 
four  genes,  celA,  celB,  celC,  and  celD.  CelB  is  the  major 
membrane  spanning  protein,  CelA  and  D  are  accessory  proteins. 
These  3  proteins  along  with  enzyme  I  and  HPr  form  the 
transport  system.  Cellobiose  is  transported  into  cytoplasm  by 
this   system   and   concurrently   phosphorylated.     The 
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phosphorylated  cellobiose  is  then  hydrolyzed  by  CelC  with  the 
product  entering  the  glycolytic  pathway. 
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